Changelogs » Determined

PyUp Safety actively tracks 362,670 Python packages for vulnerabilities and notifies you when to upgrade.

Determined

0.17.1

262a4ccc docs: add release notes for 0.17.1 (3091)
  a0fdf9dd fix: write cluster_info.json in all non-cmd task types (3094)

0.17.1rc3

6c5c02d8 fix: avoid race on schema cache load (3081) [DET-6108]
  02308ca0 fix: report progress correctly for searchers configured in epochs (3084) [DET-6112]

0.17.1rc2

114f5c9c fix: update harness to handle telemetry being off (3085)

0.17.1rc1

f7f5860c chore: add redirect to documents (3054)
  e102ac8a chore: upgrade sphinx version (3077)
  f9ed8da6 chore: update experiment and checkpoint imports for consistency (3079)
  f05c32b6 fix: fix an issue in some CLI aliases not working (3078)
  0ea9b0e1 fix: update `helm push` command to `helm cm-push` (3075)

0.17.1rc0

9b977b92 chore: lock api state for backward compatibility check
  6689e625 fix: mispelling [DET-6095] (3073)
  8419fd1a chore: remove flaky tests (3069)
  0e616f2e chore: speed up cli startup time (3061)
  f536460a feat: add Notes tab on experiment pages [DET-4691] (3048)
  cadb2f66 test: stop trying to close modal twice in a row (3067)
  c0e17570 fix: always `mkdir` default mounted `checkpoint_storage` `host_path`. (3065)
  c1aa0880 chore: rename cpu containers to aux (3056)

0.17.1.dev0

ff4df832 docs: add release notes for 0.17.0 (3024)
  0fda11fd ci: don't depend on badssl.com for test_custom_tls (3062)
  b916abf6 ci: update gke version (3051)
  693ded3b feat: run db migrations in transactions [DET-5987] (3025)
  6b4ff187 chore: environment bump analytics-python (3057)
  abb3250c feat: add segment tracking python package to harness (3053)
  bda42c52 test: update experiment row kill to handle modal confirmation (3055)
  94fdc507 chore: remove deprecated io-ts any type (3045)
  51453e44 chore: tweak samples_per_second metric to represent all workers (3050)
  68d9aca6 docs: reorganize the document structure (3034)
  ca594d0b fix: gracefully handle prestart agent failures (3049)
  2a956a1f chore: remove NativeContext and simplify Context inheritance (3044)
  cc26061e Revert "test mmdetection on p3.8xlarge"
  1a3036d0 test mmdetection on p3.8xlarge
  e0129f52 fix: Make agent names unique for det deploy local agent-up (3038)
  f0273d1c chore: add server-side portion of external session handling (3016)
  3b1df0ca feat: introduce ClusterInfo API (2946)
  85aabd3b chore: added confirmation modal to task kill [DET-6049] (3035)
  ac31bac5 chore: adding markdown component (3033)
  b21d88ee fix: use str for FileLock (3036)
  50eb8f7b chore: rewrite schemas package without typing internals (3029)
  12f7427b feat: cross-compile for powerpc64 (2828)
  9012a785 chore: prefer https to ssh for git dep (3028)
  5058c60e chore: update release note guidelines (3027)
  aa0252f0 fix: fix nested hparams with grid (3021)
  7b9fd713 test: fix flake from race in idle watcher tests (3008)
  e0e84bf2 chore: mark open allocs as closed on restart (3019)
  b7f1c3c9 chore: upgrade to Go 1.17 (3015)
  87e791fc chore: lower e2e-webui resource class (2935)
  7f40dabe fix: propagate podspec to gc (3012)
  b8afaa4d include load fast flag in 2.6 (3007)
  18986727 feat: allow kubernetes to use priority from exp config (2956)
  6fc5e7e4 refactor: move app queries and migrations out of `internal/db/postgres`. (3014)
  2709f56a chore: handle query param jwt for external auth (2992)
  22fcb05d chore: clean notebook readme (3013)
  4d1734c8 fix: notebook idle check use master port, cert [DET-6013] (3010)
  54e3e567 chore: menu items require keys for newer antd versions (4.x+) (3011)
  3dfcc11a chore: update package json [DET-5846] (2982)
  e5caf86f chore: recover agent websocket flakes [DET-5935] (2991)
  888eb5da feat: add CPU images for TF 2.5 and 2.6 [DET-5877] (2981)
  a3886178 chore: minor copy fix (3009)
  44ebc982 fix: save task end times (3006) [DET-6028]
  cf68f173 chore: nit command exit message placeholder (3003)
  d3595e3d chore: add a few timings metrics when sync_required == True (2996)
  a0107855 fix: Inline editor for experiment description truncates placeholder text [DET-6024]
  70a089b4 fix: kill trial should send kill (3001) [DET-6026]
  8d7a4519 fix: `det t describe --metrics` API and rendering [DET-6025] (3002)
  76c17f32 fix: bug in notebook README (3004)
  9f4e2702 fix: remove refs to workload start_time from det e describe (2995)
  32c41e12 fix: notebook wait page updates for API changes (2998)
  c9b16e58 fix: rename trial job type to experiment (2997)
  e1f79257 chore: fix nil deref in idle timeout watcher (2993)
  24043e6e fix: always `helm push` latest version (2994)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:latest`
  - `docker pull determinedai/determined-master:0.17.1`
  - `docker pull determinedai/determined-master:a2ac78ba`
  - `docker pull determinedai/determined-master:a2ac78ba1ecf397a2a156c9b9b3ed3bee057899d`
  - `docker pull determinedai/determined-dev:determined-master-a2ac78ba`
  - `docker pull determinedai/determined-dev:determined-master-a2ac78ba1ecf397a2a156c9b9b3ed3bee057899d`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.17.1`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:a2ac78ba`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:a2ac78ba1ecf397a2a156c9b9b3ed3bee057899d`

0.17.0

954f2972 docs: add release notes for 0.17.0 (3024)

0.17.0rc4

b12cb244 chore: mark open allocs as closed on restart (3019)
  aebd71e5 chore: clean notebook readme (3013)
  2b5be1eb fix: propagate podspec to gc (3012)
  b0613823 feat: allow kubernetes to use priority from exp config (2956)

0.17.0rc3

9f0d23a2 fix: notebook idle check use master port, cert [DET-6013] (3010)

0.17.0rc2

826f8478 fix: save task end times (3006) [DET-6028]
  ed107fd2 chore: nit command exit message placeholder (3003)
  92d63f63 chore: add a few timings metrics when sync_required == True (2996)
  701458e2 fix: remove refs to workload start_time from det e describe (2995)
  79f4cd83 chore: recover agent websocket flakes [DET-5935] (2991)
  53beb628 feat: add CPU images for TF 2.5 and 2.6 [DET-5877] (2981)

0.17.0rc1

598d7f4b fix: notebook wait page updates for API changes (2998)
  be56db25 chore: fix nil deref in idle timeout watcher (2993)
  3930ced4 fix: `det t describe --metrics` API and rendering [DET-6025] (3002)
  7dcd4f6a fix: kill trial should send kill (3001) [DET-6026]
  f0cf24d2 fix: bug in notebook README (3004)
  3d878ebd fix: always `helm push` latest version (2994)

0.17.0rc0

070dd4ba chore: lock api state for backward compatibility check
  0b65b7ba feat: det deploy local: remove support for --auto-bind-mount [DET-5948] (2932)
  b15b6894 fix: tell mypy to ignore azure (2990)
  bcd959c8 fix: update cuda for fake tests (2983)
  a0db48ea Add support for float16 serialization (2915)
  d482f252 fix: address CVEs in agent & master docker images. (2989)
  66b55b72 chore: update notebook README [DET-6001] (2985)
  97262808 fix: implement boto3 wrapper to allow refreshable credentials [DET-5690] (2957)
  bea93411 chore: StorageManagers operate on uuids, not checkpoint manifests (2970)
  5542a0a7 chore: confirm with users when running det deploy aws down [DET-6000] (2984)
  8c37d5b2 chore: update task log response shape (2986)
  576b51f7  feat: support configuring working directory for tasks [DET-5009] (2773)
  1825c546 fix: make PIDServer send SIGKILL after waiting on SIGTERM (2976)
  64bf390a chore: unify task types [DET-5950, DET-5955] (2938)
  8d98692d chore: popout new tab when clicking on task list links [DET-5998] (2979)
  100f9b10 feat: allow experiment owner to delete their own experiments [DET-5989] (2977)
  36ba4c35 chore: use mock library in doc building (2968)
  086f0db3 chore: remove -r option since default macos ln doesn't support it (2971)
  5a43b3ba chore: remove start_time from get_checkpoints_for_trial (2975)
  8f22270c feat: remove start_time from all workload types [DET-5979] (2912)
  483a24b1 chore: add STEP_WITH_OPTIMIZER setting for lr scheduler (2960)
  fbb3294d chore: rework tensorboard and checkpoint gc paths (2948)
  2c4f97e0 chore: fix returning nil error (2972)
  557ea3ab chore: update GET raw allocation to account for loss of workload information [DET-5973] (2911)
  722b89f1 fix: model-hub mmdetection logging (2964)
  28cea50a chore: restore saml auth file to match ee version (2967)
  451f5873 refactor: move ee to oss [DET-5937] (2963)
  0eb13903 fix: update logic on when query url should be overridden (2942)
  c4e97439 chore: add support for batch delete of experiments [DET-5224] (2958)
  1f4315db chore: remove trial details start time related stats boxes [DET-5956] (2944)
  e2319f8d chore: rename download model button (2962)
  7344ce10 feat: add detectron2 example (2918)
  a5d0e7df chore: rewrite primary resource allocation query over public.allocations [DET-5972] (2910)
  f070ab20 chore: add warnings on resource manager exits (2903)
  b0b5427e chore: add documentation for model-hub mmdetection [DET-5924] (2955)
  e03bd4dc fix: fork nested hp [DET-5945] (2953)
  3b1018c0 chore: fix a log message (2945)
  5d11b817 fix: extraneous minio warning while using s3 (2916)
  e64befd7 chore: fix rstrip bug in refresh-ubuntu-amis (2954)
  e0e59123 fix: scroll trial ids with values in trial comparison [DET-5918] (2933)
  560b38ee chore: update docs link in notebook webui modal (2950)
  1295bbd9 feat: add support for nan and infinity metrics [DET-5944] (2943)
  b1e33247 chore: pin mockery version (2949)
  cddff01f fix: make uPlot axis expand to show new data when not zoomed in [DET-5941] (2928)
  5f5b4456 chore: add support for throughput profile chart [DET-5596, DET-5732, DET-5913, DET-5923] (2886)
  5d9918df chore: add 1.17 golang build syntax (2929)
  8bc8b258 fix: correct the logic for hiding log preview for completed trials (2939)
  b2da0721 feat: trial log preview [DET-5882] (2871)
  99940002 fix: kubernetes link with agent user [DET-5907] (2927)
  bb0fec14 fix: e2e nightly model-hub tests (2925)
  9ddb0ec6 fix: make clear forbidden vs. unauthenticated [DET-5869] (2870)
  9ded187f ci: replace make -C tools with devcluster. (2892)
  fb086591 ci: unpin pip version, improve py venv cache key. (2922)
  48ee5a13  feat: Support passing an existing EFS to det deploy aws [DET-5737] (2803)
  c8acf891 fix: change styling on "stop experiment" modal [DET-5837] (2894)
  f5dc7419 docs: update k8s version to 1.19 >= and <= 1.21 (2887)
  8f2a4896 ci: restrict setuptools version. (2920)
  59ca50e1 chore: fix master/agent Docker image vulnerabilities [DET-5926] (2914)
  7d1c9913 feat: add a `make devcluster` target (2900)
  a6e21725 test: fix batch action misclick on e2e tests [2872] (2877)
  dd17c3a8 docs: update idle timeout (2917)

0.17.0.dev0

256a8fd2 docs: add release notes for 0.16.5 (2913)
  97e5e1ff build: remove unneeded build-bindings dependencies (2898)
  ac734f0d feat: support configuring idle timeout for generic commands [DET-4589] (2787)
  190b167f feat: add notebook idle timeout [DET-5517, DET-5519] (2868)
  08cc3100 chore: remove old harness profiler (2901)
  1876c04a ci: `using_k8s` should use `det master config -o json`. (2908)
  81c23f4c fix: pin version of `torchmetrics` for docs builds (2907)
  36bf273c chore: fix docstring syntax and remove extra whitespace (2905)
  c0c5bf85 feat: add sync timing toggle for profiling (2874) [DET-5891]
  76f18b1e chore: update master ClusterRole in helm to permit "list" on "events" (2904)
  292f9428 fix: `det deploy aws --retain-log-group` (2906)
  95c859b2 feat: support mmdetection in model-hub [DET-5471, DET-4558, DET-5609, DET-5610, DET-5474] (2792)
  13fa0356 chore: limit overviewstats info height to one line (2899)
  4646dfe3 fix: include maxval in int hparam range (2884)
  6dc1422a feat: master yaml templates for `det deploy aws|gcp` [DET-5766] (2766)
  7823b36e fix: enable experiment controls in the header for single trial (2902)
  fc3b83e8 feat: push architecture (webui side only) (2855)
  eea38c6f feat: push architecture (python side only) (2771)
  00dbd73f feat: push architecture (master side only) (2776)
  5322e9d7 fix: remove rerender on row selection on Experiment List page (2897)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:latest`
  - `docker pull determinedai/determined-master:0.17.0`
  - `docker pull determinedai/determined-master:7e6721ba`
  - `docker pull determinedai/determined-master:7e6721ba8ed9c0ca2d182633982b0a091a6f6d26`
  - `docker pull determinedai/determined-dev:determined-master-7e6721ba`
  - `docker pull determinedai/determined-dev:determined-master-7e6721ba8ed9c0ca2d182633982b0a091a6f6d26`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.17.0`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:7e6721ba`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:7e6721ba8ed9c0ca2d182633982b0a091a6f6d26`

0.16.5

af5bd3e4 docs: add release notes for 0.16.5 (2913)

0.16.5rc2

a24a14be fix: pin version of `torchmetrics` for docs builds (2907)
  c161bcec chore: update master ClusterRole in helm to permit "list" on "events" (2904)
  0e5900f9 feat: add sync timing toggle for profiling (2874) [DET-5891]

0.16.5rc1

8e4b5d0f fix: enable experiment controls in the header for single trial (2902)

0.16.5rc0

325b29bc chore: lock api state for backward compatibility check
  863a2d11 feat: expose primitives for pytorch dataloaders (1937)
  beff21b5 chore: augment fields displayed by describe checkpoint (2889)
  4d08b011 fix: make k8s watchers more resilient [DET-5910] (2880)
  6e4efcc3 chore: bump gke patch version (2890)
  164560dd ci: push `latest` tag for master and agent Docker images (2891)
  787522e1 fix: relax imagenet ci target (2883)
  1f6c8cbf feat: add `det agent disable --drain` [DET-5713] (2827)
  70a6f7fb fix: profiler metrics without follow should return metrics [DET-5911] (2879)
  053283a2 build: sunset circleci based react preview (2881)
  920885ca docs: update copyright date (2885)
  ed4bedfa chore: users can delete their own exps [DET-5901] (2878)
  489b1c62 docs: use virtual environments [DET-5361] (2862)
  f4563995 docs: reorganize documentation (2861)
  4ab688c7 fix: load pre-0.13.8 checkpoints properly (2876)
  1c997dab refactor: ban python builtin shadowing. (2875)
  14012f43 fix: don't validate entire expconf on preview-search (2873)
  c8c151b0 fix: make entrypoint startup-hook.sh eval consistent [DET-5874] (2847)

0.16.5.dev0

5bab3170 docs: add release notes for 0.16.4 (2865)
  dca931e7 chore: make MetricsBatcherThread safer (2864)
  ad1eb63d fix: uPlot to show values of 0 (2863)
  1d94e50b fix: remove bad switch default (2859)
  9e26e993 fix: dont force nvidia runtime for users using Docker native GPU support (2854)
  174e9949 fix: remove visual gap on trial comparison (2857)
  906a76cb refactor: apply use settings on other sections (2849)
  f2e4b919 fix: fix row selection and errors on experiement visualization (2856)
  3d898190 fix: test credentials for test_tf_keras_mnist_data_layer_ (2853)
  1bd92e7a style: adjust styles to render exp config and logs to render properly on mobile (2851)
  d0284bbd fix: switch container runtime on slot type (2845)
  03cd749b docs: describe how to set task priorities (2850)
  d60f34ec fix: reduce imagenet ci time (2848)
  4eb5c3b4 feat: collect sync_optimizer and backwards pass timings [DET-5724] (2820)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:latest`
  - `docker pull determinedai/determined-master:0.16.5`
  - `docker pull determinedai/determined-master:106b3528`
  - `docker pull determinedai/determined-master:106b352802563243c49c52e0a9972e5b04257a25`
  - `docker pull determinedai/determined-dev:determined-master-106b3528`
  - `docker pull determinedai/determined-dev:determined-master-106b352802563243c49c52e0a9972e5b04257a25`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.16.5`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:106b3528`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:106b352802563243c49c52e0a9972e5b04257a25`

0.16.4

69817c42 docs: add release notes for 0.16.4 (2865)
  f87495fa docs: describe how to set task priorities (2850)

0.16.4rc0 not secure

5b6f3180 chore: lock api state for backward compatibility check

0.16.4.dev0

1b95dcd4 docs: add release notes for 0.16.3 (2774)
  b7c88380 feat: support tf 2.6 in TFKerasTrial, update env images. (2839)
  05ddec47 chore: remove some unnecessary conversions and error paths (2840)
  acd45055 fix: support pre-cross_rank versions of horovod (2841)
  e3d76b22 fix: casts non-number HPs to strings (2837)
  892fd66e refactor: responsive table batch [DET-5848] (2836)
  c12cd203 feat: adding profiling metrics to continuous benchmarking (2796)
  b4e0a1af feat: add the ability to set job priorities on the fly [DET-5863] (2834)
  5ca1aacd docs: improve Notebook docs (2811)
  94f7b35b fix: reformat porting tutorial (2833)
  1102ced1 docs: add porting guide (2624)
  6d69c532 feat: add imagenet pytorch example (2623)
  05a488b4 fix: add in 'just a snapshot' msg to not lose progress on restart (2830)
  cd98af67 chore: clean up priority scheduler code a bit (2831)
  f6c41cf7 feat: add links to trial pages in trial comparison modal [DET-5850] (2817)
  9cf63894 style: slight style tweaks for trial comparison (2829)
  304b25c4 feat: allows selection and unselection of hps and metrics in trial comparison table [DET-5854] (2826)
  cbd74085 feat: add ability to compare Trials from Learning Curve and HP Parallel Coords charts [DET-5851] (2822)
  aa43d87a refactor: settings data flow [DET-5625] (2786)
  b937a7bc fix: remove unnecessary rows from trial comparison [DET-5857] (2816)
  b7596b76 feat: add Trial ID to header of single-trial experiment page [DET-5816] (2821)
  9648ac13 fix: show tabs independent of trial detail loading [DET-5839] (2823)
  305f138a refactor: allow user to inline edit experiment name [DET-4405] (2674)
  26d395ee fix: properly dedup container configs (2825)
  1acad83e fix: nail down installed swag cmd version. (2824)
  ad634874 fix: issue with install and version check of tensorboard (2819)
  6615a3fa fix: don't display indefinite spinner if experiment is paused [DET-5849] (2813)
  081b329c fix: size and resize the config monico editor [DET-5736] (2802)
  21075271 chore: upgrade typescript target to es6 (2812)
  24492cfe fix: minor trial comparison layout adjustments [DET-5852] [DET-5853] (2814)
  579685b2 chore: basic python 3.9 support. (2808)
  b4a58722 chore: don't silently drop agent ws failures (2755)
  706b30b6 fix: correct timing metric chart y-axis label to Seconds (2810)
  46bb3efe chore: increase gRPC recv cap to accomodate equal size shell and experiment context dirs (2807)
  02e3210a update gke version (2806)
  4bfebadc feat: add experiment deletion support to the webui [2752] (2775)
  3486a499 feat: switch to bash in jupyterlab shells. [DET-5791] (2804)
  8112a7e4 feat: move workloads back to trial overview page [DET-5738] (2799)
  f08ab048 fix: preserve zoom levels between uPlot remounts [DET-5636, DET-5751] (2797)
  9297c842 ci: add label for GKE clusters in CI (2800)
  61886e1e feat: add trial comparison modal [DET-5417] (2794)
  40292507 docs: fix formatting lint failure (2801)
  55d9c6f2 fix: remove track for constant hps [DET-5815] (2798)
  d9418557 refactor: fork and continue trial [DET-5817] (2765)
  fd7cf0d7 docs: add manual aws modification steps [2716] (2749)
  8c452512 feat: change k8s preemption scheduler to backfilling scheduler [DET-5398] (2795)
  89315931 test: improve web e2e test stability [2750] (2777)
  7769d2e9 ci: add linter for secrets (2791)
  f10108fa docs: add git-secrets docs [DET-5830] (2790)
  673cf1df fix: prevent log html injection via unicode [DET-5826] (2789)
  81258edb chore: fix release note lint (2788)
  18a7f5cf chore: remove stale install extras from harness python package. (2767)
  23d6a8a4 fix: only run circleCI step on master branch (2782)
  29cddc03 feat: add `--config` override to `det e create`. [DET-5786] (2769)
  e28e871e chore: update rstfmt line length to 100, and reformat all docs. (2768)
  48c5a306 fix: fix indefinite spinner for terminal single-trial experiments (2772)
  28a106c4 feat: persist checkbox selection when changing page if results are paginated [DET-5416] (2756)
  52c93a50 fix: allow full-sized model definitions through grpc (2762)
  6c913f1b fix: add reload route to allow remounting of same pages [DET-5818] (2761)
  f56255e4 feat: integrate test results from CircleCI jobs with persistent benchmarking (2737)
  95db1fe4 fix: fix stale trial data caused by internal react re-route (2763)
  a42b6ad9 fix: POST /api/v1/experiments/:id/cancel should cancel not kill trials (2759)
  73d5d31e fix: fix wait messaging around undefined trial id (2757)
  f3d6a82d chore: fix log type hyperparameters (2758)
  8db5e260 fix: pull model definitions into containers [DET-5788] (2753)
  866670b7 feat: async DELETE experiment [DET-5804] (2741)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.16.4`
  - `docker pull determinedai/determined-master:88e26e66`
  - `docker pull determinedai/determined-master:88e26e66f3da10cb2867bba9e3d3883e51af6c8a`
  - `docker pull determinedai/determined-dev:determined-master-88e26e66`
  - `docker pull determinedai/determined-dev:determined-master-88e26e66f3da10cb2867bba9e3d3883e51af6c8a`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.16.4`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:88e26e66`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:88e26e66f3da10cb2867bba9e3d3883e51af6c8a`

0.16.3 not secure

ffff0f46 docs: add release notes for 0.16.3 (2774)

0.16.3rc2 not secure

a027c813 fix: allow full-sized model definitions through grpc (2762)
  bfa40d69 fix: add reload route to allow remounting of same pages [DET-5818] (2761)
  28b41218 fix: fix stale trial data caused by internal react re-route (2763)
  e7b37d4c fix: POST /api/v1/experiments/:id/cancel should cancel not kill trials (2759)
  c8937772 fix: fix wait messaging around undefined trial id (2757)

0.16.3rc1 not secure

e226dad5 chore: fix log type hyperparameters (2758)
  7e38c1bc fix: pull model definitions into containers [DET-5788] (2753)

0.16.3rc0 not secure

4ce8500b chore: lock api state for backward compatibility check
  d411174f feat: add AKS support [DET-5464] (2524)
  ca44e980 fix: k8s available GPUs indicator [DET-5808] (2754)
  6da63551 feat: add nested hp support [DET-4786] (2699)
  8c5b689a test: set explicit agent slot_type for e2e-tests local and ci env (2747)
  e63e5223 fix: correct single trial experiment routing [DET-5789] (2748)
  b4ab5b1a chore: update the jackc/pgx version for ResetSession hook (2729) [DET-5018]
  c0a6d4e9 fix: hp search configs for darts cnn example (2725)
  9df9099d feat: add filtering by state to trials page [DET-5730] (2732)
  db7437c7 fix: false positive scaler_state_dict warning (2745)
  bdf86bfd refactor: improve fork error message [DET-5795] (2735)
  683fbc65 chore: replace node sass [DET-5806] (2743)
  ff732a15 chore: fix rendezvous address port parsing (2744)
  8cb35401 chore: re-enable automatic updates to the preview cluster [2680] (2734)
  ab3f2bf7 chore: add patch to webui proxy (2742)
  22d67ff5 refactor: clean up task spec 2 (2698)
  a901aa19 chore: Add back Continue Trial button to single-trial experiment page [DET-5749] (2731)
  3deb93f3 feat: check server reachability on first load [2739] (2721)
  11a46089 feat: add view logs action to experiment trials [2714] (2723)
  2e6182eb docs: fix some monospace treatment. (2712)
  6a97d46d test: bump test_streaming_observability_metrics_apis timeout (2738)
  7bee7521 fix: always terminate streams for terminated trials [DET-5790] (2728)
  d7fd221a fix: restrict google-cloud-storage dependency. (2736)
  90b87000 fix: destroy hp-viz tabs when navigating away [DET-5794] (2730)
  a77e3b9c chore: temporarily disable automatic updates to the preview cluster (2733)
  40a24513 feat: add synchronized query paramaters to several pages [DET-5301] (2711)
  4ea7a1cb feat: add descriptive messages to various loading spinners [2719] (2718)
  fa90df6d feat: enable sorting trials by state [2673] (2722)
  06628d28 chore: fix a missing loading indicator for task list (2720)
  e84ad9f0 docs: fixing typo (2710)
  5a825ad2 chore: fix scientific notation in example yamls (2688)
  953af22f fix: bug in eval for dtrain for question-answering example [DET-5756][DET-5757] (2707)
  32b30505 chore: update model-hub transformers base image [DET-5701] (2614)
  1840ea86 fix: Add legend to Trial metrics graph [DET-5723] (2663)
  ebf0e92c refactor: show "no data" instead of spinner when multi-trial is not available (2713)
  a8e3d8db fix: cpu only preemption [DET-5763] (2717)
  5f4db2de chore: replace proteins_pytorch_geometric example with a better one. (2706)

0.16.3.dev0

67ade3c5 docs: add release notes for 0.16.2 (2709)
  c0f55289 docs: EKS Auto-scaling fix [DET-5728] (2677)
  dae469fa fix: immediately fetch single trial data when able to [DET-5758] (2708)
  41b73c39 chore: audit and update react dependencies (2650)
  5aad1bcd fix: fix an issue with rendering boolean hp values in learning curve table [2670] (2694)
  65f7ab0b chore: improve dynamic server address support (2690)
  3ce4b517 docs: warn users not to use cpu training with a custom scheduler (2703)
  5f4ff293 revert: "fix: prevents k8s priority scheduler from blocking cpu only training (2631)" (2702)
  4a52b926 fix: remove proteins_pytorch_geometric from nightlies. (2701)
  d2c1f351 chore: regenerate certificates for multimaster test. (2704)
  3a8a2bc0 fix: e2e test (2700)
  93e92a91 build: let netlify handle node dependency management (2693)
  d39adbc9 fix: no data available [DET-5734] (2695)
  488fe333 chore: fix error introduced by rebasing (2697)
  df183b9a feat: allow nightly gpu tests to be requested (2692)
  115e357d chore: clean up task spec (2662)
  c3ef821a fix: preserve subroutes (2676)
  3f99b6ca Revert "remove slack user mentions"
  f81a1b43 remove slack user mentions
  273fc78c chore: move automount path into /run/determined/workdir (2687)
  517b7b14 ci: add preview cluster creation to Circle CI (2681)
  bb4b1cc3 fix: visual bug in hp plot [DET-5720] (2689)
  4bee18b9 fix: e2e nightly tests affected by environment upgrades (2683)
  80a56b5d fix: return tensorboard config with GET /api/v1/tensorboards/<id> (2685)
  c9c1a7f7 fix: avoid guessing whether loginRedirect is an internal route  [2686] (2684)
  759c9aab feat: Add Efficientdet example (1733)
  e5e06959 fix: correctly interpret minval and maxval for log hps (2682)
  037ead62 build: enable and add Netlify config [2629] (2651)
  b1d74327 chore: lint det deploy (2679)
  b9340d3d chore: fix a flag in det deploy gcp (2678)
  a7f3da6c feat: improve det deploy [DET-5684] (2675)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.16.3`
  - `docker pull determinedai/determined-master:abc20a36`
  - `docker pull determinedai/determined-master:abc20a36d08929681fca9e64710ef1189bdbff15`
  - `docker pull determinedai/determined-dev:determined-master-abc20a36`
  - `docker pull determinedai/determined-dev:determined-master-abc20a36d08929681fca9e64710ef1189bdbff15`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.16.3`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:abc20a36`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:abc20a36d08929681fca9e64710ef1189bdbff15`

0.16.2 not secure

b7b153d8 docs: add release notes for 0.16.2 (2709)

0.16.2rc6 not secure

7267a6cd fix: immediately fetch single trial data when able to [DET-5758] (2708)

0.16.2rc5 not secure

27cfbc5a fix: remove proteins_pytorch_geometric from nightlies. (2701)
  c9080630 fix: fix an issue with rendering boolean hp values in learning curve table [2670] (2694)
  cfc2d9a7 fix: fixes commit 9fec2f9e14d that cherry picked a revert (2705)

0.16.2rc4 not secure

ffe8f444 chore: regenerate certificates for multimaster test. (2704)
  dd67fe70 docs: warn users not to use cpu training with a custom scheduler (2703)
  9fec2f9e revert: "fix: prevents k8s priority scheduler from blocking cpu only training (2631)" (2702)
  8cf08359 fix: no data available [DET-5734] (2695)

0.16.2rc3 not secure

e226c699 fix: visual bug in hp plot [DET-5720] (2689)
  70a822f7 chore: move automount path into /run/determined/workdir (2687)
  af7970a3 fix: e2e nightly tests affected by environment upgrades (2683)
  702328c2 chore: lint det deploy (2679)
  97e4e600 chore: fix a flag in det deploy gcp (2678)
  6d4397ea feat: improve det deploy [DET-5684] (2675)

0.16.2rc2 not secure

fd8b8135 fix: return tensorboard config with GET /api/v1/tensorboards/<id> (2685)
  d7627b0e fix: avoid guessing whether loginRedirect is an internal route  [2686] (2684)
  b94d9d28 fix: correctly interpret minval and maxval for log hps (2682)
  7977c2c0 chore: fix a flag in det deploy gcp (2678)
  f2c37dba feat: improve det deploy [DET-5684] (2675)

0.16.2rc1 not secure


        

0.16.2rc0

09602aa4 chore: lock api state for backward compatibility check
  8f30870f fix: cli auth storage updates on Windows. (2671)
  9aa67587 chore: update images for release (2672)
  e826bfee feat: improve support for `pytorch_geometric` and custom pytorch batches (2644)
  fc008944 fix/docs: support det-deploy local cluster-up specified directory to bind mount [DET-5431] (2668)
  ae91f6fd install tensorboard if not already installed (2633)
  6c65dbab fix: pull registry_auth from experiment for tensorboard (2616)
  dc129166 chore: update experiment tag list limits (2655)
  cd0898a4 chore: simplify resourcetype related states following its removal (2658)
  c8fc329e fix: fix an issue with SPA routing affecting model download [2648] (2661)
  b9501a8a fix: kube RPs should allow aux tasks [DET-5710] (2652)
  eb57a0e3 fix: correct for possibility of negative numbers in log range [DET-5717] (2666)
  9996f426 fix: Add page elements to Trial page while data is loading [DET-5718] (2665)
  b969425f fix: Configuration height doesn't go to the full page height [DET-5721] (2664)
  c31f7f43 feat: support auto bind mount as part of det deploy local [DET-5432] (2610)
  c290ab25 fix: improve trial chart tooltip [DET-5712] (2653)
  4f783363 fix: header progress bar [DET-5709] (2645)
  31c87b42 feat: change hyperparameter tab on multi-trial experiment trial pages [DET-5413] (2642)
  cbf6a636 chore: add a readme for react/scripts (2641)
  6bfade11 build: remove java dependency for building webui [2591] (2581)
  7397a9d0 chore: make experiment description editable via webui [DET-4398] (2634)
  2e4c995d feat: make `det deploy gcp` clusters log to GCP Cloud Logging (2639)
  98d7a3ad chore: update webui experiment layout for multi-trial and single-trial [DET-5407] (2595)
  15f0f0a7 fix: make "Continue Trial" button work again [DET-5704] (2636)

0.16.2.dev0

40df0106 docs: add release notes for 0.16.1 (2632)
  51154955 fix: prevents k8s priority scheduler from blocking cpu only training (2631)
  64543315 fix:nightly tests broken after environment upgrades (2628)
  b21a57f6 chore: update ZMQ logic in DistributedContext (2593)
  1f945af8 feat: add rendezvous API (2420) [DET-5428]
  10b3a5ce fix: prevent impossible slot requests for notebooks [DET-5690] (2625)
  4b3278c0 chore: improve error handling for react preview proxy [gh-2622] (2621)
  4f0fa2c4 chore: Update environments to have a minimal base layer (2627)
  a1b5e4a5 docs: correct pod spec (2618)
  d612a97c fix: improve det deploy aws messaging for inconsistent stack states [DET-5695] (2617)
  0bd51656 fix: make the gRPC gateway never use a proxy [DET-5689] (2620)
  0b3920e3 feat: show a loading state while fetching profiler metrics [gh-2605] (2606)
  5f98e0b7 chore: add tooling to document echo-based apis (2529)
  94246b40 check go mod tidy causes no changes for /proto (2619)
  b09ed813 feat: add API to query an experiments best searcher validation (2422) [DET-5212]
  6acf0d11 chore: update default environment images before release (2615)
  2ab89edc fix: update custom-env docs with new versions (2611)
  d3b55edb fix: linting error in circle ci (2613)
  252351ab feat: display active table filter count and option to reset [gh-2584] (2603)
  7d094de6 fix: print out errors when agent setup script generating fails (2612)
  27e321f6 fix: avoid presenting gpu uuids in profiler if there are no uuids [gh-2604] (2608)
  09d18c09 fix: helm chart version in Makefile (2609)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.16.2`
  - `docker pull determinedai/determined-master:2eefec98`
  - `docker pull determinedai/determined-master:2eefec98a0a49856c4a44a91fc5031323d1e04ca`
  - `docker pull determinedai/determined-dev:determined-master-2eefec98`
  - `docker pull determinedai/determined-dev:determined-master-2eefec98a0a49856c4a44a91fc5031323d1e04ca`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.16.2`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:2eefec98`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:2eefec98a0a49856c4a44a91fc5031323d1e04ca`

0.16.1 not secure

a160523b docs: add release notes for 0.16.1 (2632)

0.16.1rc4 not secure

058290e1 feat: display active table filter count and option to reset [gh-2584] (2603)
  7e52ba10 fix: make the gRPC gateway never use a proxy [DET-5689] (2620)

0.16.1rc3 not secure

a028858a chore: update default environment images before release (2615)

0.16.1rc2 not secure

aa70a3aa fix: print out errors when agent setup script generating fails (2612)

0.16.1rc1 not secure

53c11d07 fix: helm chart version in Makefile (2609)

0.16.1rc0 not secure

Docker images
  
  - `docker pull determinedai/determined-master:0.16.1`
  - `docker pull determinedai/determined-master:de111d2e`
  - `docker pull determinedai/determined-master:de111d2e27edee2e6cabcdda2ba01757314caf5e`
  - `docker pull determinedai/determined-dev:determined-master-de111d2e`
  - `docker pull determinedai/determined-dev:determined-master-de111d2e27edee2e6cabcdda2ba01757314caf5e`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.16.1`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:de111d2e`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:de111d2e27edee2e6cabcdda2ba01757314caf5e`

0.16.0 not secure

091e039e docs: add release notes for 0.16.0 (2575)

0.16.0rc4 not secure

be47a79c docs: update the JupyterLab bump release note (2567)
  e5327617 fix: don't return dupes from det model list-versions (2564) [DET-5640, DET-4248]

0.16.0rc3 not secure

438b1129 perf: optimizations to query batching fetch profiler metrics [DET-5637] (2559)

0.16.0rc2 not secure

ffe65cd6 fix: Change wording on modals that edit configs. (2562)
  89649c7e fix: set elastic ip domain to vpc in det deploy aws (2557)
  e48cd1df fix: dedup BindMounts and Devices on merge (2560)
  9938be7d fix: use model instead of schema struct for de-duping (2545)
  e0c8decc docs: extend docs for the client module (2556)
  2687f4d5 docs: add python sdk docs (2547)
  5977ce02 chore: also set cli_cert in dtrain worker processes (2555)

0.16.0rc1 not secure

62e99c07 chore: fix typos (2554)
  27366005 chore: rename profiler tab in webui (2551)
  4cee9fa2 fix: Incorrect help link when profiles aren't enabled for a trial. [DET-5621] (2549)
  d623f5f4 chore: rename start_on_batch to begin_on_batch everywhere (2553)
  61be9559 chore: revamp experiment and trial pages header [DET-5406] (2456)
  ee72cdce fix: add bumpenvs for tf-2.5 images. (2552)

0.16.0rc0 not secure

b910703f chore: lock api state for backward compatibility check

0.16.0.dev0

d5145feb docs: Release notes for 0.15.6. (2493)
  068bb33f fix: prevent zoom reset if chart is already zoomed [DET-5514] (2525)
  3f44c83f fix: stop parsing notebook config on every edit [DET-5605] (2528)
  03b28bef chore: fix client for new password handling (2546)
  fe05b0b0 chore: avoid defaulting to filter by current user [DET-5602] (2540)
  1e945afb feat: expose a default Determined in det.experimental.client (2532)
  76230f83 chore: remove swagger-generated python code (2541)
  c7ac21d6 fix: password handling in python sdk. (2543)
  56dd19d4 feat: pull tensorboard images from experiment configs (2544)
  48ceaf2a fix: fix hparam string representation failure [DET-5616] (2539)
  8dfa0888 feat: pull tensorboard images from experiment configs (2534)
  0ebeba33 chore: fix dropped cert argument in Authentication (2542)
  d0adc51c feat: multimaster Authentication objects [DET-5308] (2531)
  f1c9b1f4 feat: bump JupyterLab to 3.0.16 [DET-4872] (2526)
  12a8cae4 chore: bump default environment CPU and GPU images to tf-2.4 (2523)
  caf61c97 docs: add release notes for profiling features [DET-5351] (2535)
  deb4cbf4 chore: initialize cli_cert in e2e tests (2530)
  81eefc75 chore: bump transformers version for model-hub (2522)
  e9f5947e fix: add init_invalid_hp to master [DET-5569] (2478)
  ccdcaa8b chore: allow non-singleton Authentication (2513)
  0a887e9d fix: trial profiling system metric chart ignoring zero [DET-5505] (2515)
  0d9a5401 fix: allow bumpenvs to update nvcr images in helm charts (2520)
  ec89928b feat: provide tensorflow 2.5 image [DET-5522] (2517)
  55c3353f docs: recommend users upgrade to 0.16.0 to avoid k8s master crashes (2518)
  a2f6fc26 chore: improved pynvml usage by profiler [DET-5394] (2487)
  a06d3a2e chore: minor edits to cli behaviors (2519)
  23160572 fix: add back bindmounts entry to command's default config (2521)
  3d34e1c9 fix: notebook modal improvements [DET-5599] (2511)
  6db82632 feat: add experiment notes & name [DET-5352] (2307)
  17976404 chore: update urllib3 (2504)
  49aec0db feat: support back-filling in the priority scheduler [DET-5397] (2436)
  aec1074b chore: handle error when loading notebook config (2512)
  09fca004 feat: add bind mounts to task container defaults [DET-5362] (2516)
  4068fde9 chore: collect prometheus metrics (2501)
  ed896c77 fix: python api create experiment bug (2510)
  0c9ec27e fix: avoid rc dev release mismatch notifications (2405)
  4fd33262 chore: task list filters [DET-5390] (2466)
  1f49553e test: add e2e tests for profiling features [DET-5245] (2481)
  ec9932df chore: upgrade ws to patch security vulnerability (2505)
  3857f945 chore: add experiment name to breadcrumb on trial detail page [DET-5284] (2318)
  87b1e598 docs: add release note for printable config (2507)
  212aa936 chore: disable profiling after restart [DET-5424] (2486)
  24432fe1 docs: add profiling how-to [DET-5209] (2384)
  2b04bf0c chore: fix TrialsSnapshotResponse comment typo (2492)
  1ca42b44 chore: fix TF version detection and RNG usage in test (2500)
  106294a6 chore: migrate away from spot checks and move towards waiting for an expected case (2495)
  eecc4461 fix: generating printable master config does not alter original (2502)
  bf9b3ac1 fix: observability webui fixes [DET-5567][DET-5246][DET-5506][DET-5531][DET-5530][DET-5571] (2488)
  5b73278f chore: improve profiler throughput collectors (2490)
  55b122ea chore: remove native init() functions [DET-5574] (2480)
  6ac0268d chore: add testing for `eventually` schema [DET-5560] (2467)
  6f86594e chore: remove trial old messages and consolidate others (2464)
  bae9c2d5 chore: fix some semi-broken unit tests (2483)
  3f9f2daa fix: ship gpu_free_memory correctly [DET-5508] (2497)
  0dae8015 chore: add non-streaming APIs for trial profiler endpoints (2484)
  0b0e9ca8 chore: update eslint-no-unused-vars to handle special cases (2496)
  d81f8ade fix: notebook modal bugs [DET-5573] (2476)
  8ee598d8 chore: improve performance of tfevent file filtering (2469)
  341fb4fa chore: trim unused parts of rendezvous info (2381)
  ba07a04e chore: promote profiler APIs out of unimplemented (2485) [DET-5587]
  3f532898 fix: send all batches from harness profiler [DET-5566] (2473)
  c5201873 chore: deprecate det.experimental.create_trial_instance() (2479)
  b0f57d69 fix: ProfilingAgent serializing timestamps incorrectly (2482)
  6a673830 fix: propagate slots when it is 0 (2477)
  4b97010a chore: measure profiler timings with time.time() (2475)
  2e38dfab chore: reword README for schemas (2474)
  3d6e73db fix: show x axis label on all plots [DET-5500] (2471)
  2e83f225 fix: make tf estimator dtrain work with tf 2.5 [DET-5563, DET-3762] (2468)
  f893eeef fix: timing metric chart x-axis tick off [DET-5501] (2472)
  aa8d4427 chore: log running of migrations (2463)
  36139a1d docs: add instructions to use dtrain workflow for inference with PyTorch (2386)
  66c64521 feat: hook ProfilerAgent into harness and add profiler timings [DET-5062, DET-5204] (2348)
  c52c6165 chore: move run increment to allocation not termination [DET-5559, DET-5450] (2462)
  feac8cf7 feat: add launch notebook modal [DET-5376] [DET-5377] [DET-5380] [DET-5378] [DET-5379] [DET-5375] (2398)
  7c178564 chore: catch ruamel.yaml Duplicate Key Errors and format for users [DET-5542] (2450)
  2584c5b8 chore: rem to px [DET-5327] (2433)
  ddf8693d fix: allow custom registries with determined env images [DET-5556] (2465)
  8c1d0a99 fix: cleanup iter(DataLoader) before exiting [DET-5558] [DET-5554] (2459)
  2c3bfa38 fix: use user preferences when no search params are present (2460)
  80f4375b chore: disable dashboard recent tasks tests temporarily (2461)
  7f1c61d1 feat: `det deploy --image-repo-prefix` for pulling images from a custom docker repo (2454)
  a5170400 fix: synchronize pods actor startup in k8s resource manager [DET-5536] (2453)
  ea4566f1 fix: update Buf image and CLI usage (2455)
  80920725 chore: bump buf and protoc version [DET-5534] (2446)
  92bf2c63 fix: prevent concurrent updates to a single expconf object [DET-5543] (2451)
  ea66301a revert added example model (tf classification) (2452)
  71a35025 fix: prevent spot resource pool contention [DET-5349] (2423)
  8def1560 cli: small rewording in shell help (2448)
  bac39242 ci: regen buf image with buf 0.12.1 (2447) [DET-5534]
  193ac654 docs: fix broken links (2439)
  da7fe34c fix: introduce LegacyConfig for tensorboard and checkpoint gc [DET-5533] (2444)
  a9f0fe87 fix: omit internal fields in previewed notebook [DET-5523] (2434)
  a690381a fix: allow EOL searchers in configs only [DET-5526] (2445)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.16.0`
  - `docker pull determinedai/determined-master:f5a590b8`
  - `docker pull determinedai/determined-master:f5a590b8e8b0f589f8086111c93a42f92760041c`
  - `docker pull determinedai/determined-dev:determined-master-f5a590b8`
  - `docker pull determinedai/determined-dev:determined-master-f5a590b8e8b0f589f8086111c93a42f92760041c`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.16.0`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:f5a590b8`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:f5a590b8e8b0f589f8086111c93a42f92760041c`

0.15.6 not secure

25450846 docs: Release notes for 0.15.6. (2493)

0.15.6rc3 not secure

bfad801e chore: move run increment to allocation not termination [DET-5559, DET-5450] (2462)
  fa03ce85 ci: regen buf image with buf 0.12.1 (2447) [DET-5534]

0.15.6rc2 not secure

6546b65d chore: catch ruamel.yaml Duplicate Key Errors and format for users [DET-5542] (2450)
  4e3ade04 fix: allow custom registries with determined env images [DET-5556] (2465)
  730daeb7 fix: cleanup iter(DataLoader) before exiting [DET-5558] [DET-5554] (2459)
  d891b905 fix: synchronize pods actor startup in k8s resource manager [DET-5536] (2453)
  c2f08d23 fix: use user preferences when no search params are present (2460)
  ae8d54e0 revert added example model (tf classification) (2452)
  b898f119 fix: prevent spot resource pool contention [DET-5349] (2423)
  a7ce160a docs: fix broken links (2439)
  14300c4b cli: small rewording in shell help (2448)

0.15.6rc1 not secure

026a929d fix: allow EOL searchers in configs only [DET-5526] (2445)
  71166d16 fix: introduce LegacyConfig for tensorboard and checkpoint gc [DET-5533] (2444)
  ecf0b66b fix: omit internal fields in previewed notebook [DET-5523] (2434)

0.15.6rc0 not secure

016c33d9 chore: lock api state for backward compatibility check
  cd5c9399 fix: webui observability show chart only if metrics are available [DET-5418] (2424)
  6f677995 docs: notify users of coscheduler behavior [DET-5150] (2442)
  861c19a6 fix: resource pool not saved in the DB [DET-5485] (2435)
  343d8106 chore: whitelist `eventually` from schema linter
  25f6f3a8 feat: add `eventually` extension to schema [DET-5520] (2432)
  2f9dcff9 chore: Prevent _swagger from being formatted by make -C harness fmt (2440)
  46437483 docs: update procedure for latest NVIDIA drivers on GKE (2429)
  4f2a16fe chore: minor copy fix for alert box spaces (2438)
  6e255d0b fix: make profiling schema more lenient [DET-5497] (2409)
  8c965419 chore: update OS and other language in Terraform modules [DET-4276] (2415)
  5c4b5026 chore: reduce minimum char to fuzzy match for omnibar (2430)
  fe71847b fix: correct lint issues (2437)
  c7dcd414 chore: more eslint rules [DET-5513] (2426)
  d272e891 fix: observability webui widen dropdowns so the entire string is readable [DET-5503] (2425)
  55ff22e2 fix: fix convergence and distributed tests for tensorflow example (2431)
  c055571e docs: using `det shell` as a remote shell in IDEs. (2428)
  c71ff13f feat: omnibar initial support [DET-5374] (2308)
  57504fe0 chore: update default images (2427)
  4f7acabd fix: merge logic for union-type configs [DET-5486] (2410)
  366b82b2 feat: `det shell` option to show ssh command for use in IDE [DET-5462] (2407)
  98b5a1ef chore: table head style update (2419)
  e938ad19 chore: terminate /api/v1/trials/:id/avialable_series on trial termination (2418) [DET-5499]
  e3f4fb91 fix: improve tqdm rendering in the web ui (2320)
  2bf85348 Disable tests that will never pass on mac os x (2417)
  077bec89 docs: resource pool fixes (2408)
  8da96e99 fixed typo in custom custom docker configuration (2413)
  d7aa85f6 docs: update create_experiment (2416)
  0719c1d6 fix: fix an issue with parsing old exp config labels [DET-5487] (2411)
  6b260e72 chore: only return the port binding appropriate for the proxy [DET-5495] (2401)
  f9e099e0 docs: update parameter string (2412)
  d075392b feat: python-sdk [DET-5371] (2317)
  74b4e250 feat: add new multiclass text classification example for tensorflow [DET-5277] (2396)
  8e41491f feat: support more types of CPU instances on AWS [DET-4939] (1907)
  58b7e02a feat: experiment list search [DET-5460] (2392)
  66158f0c chore: update trial page overview layout [DET-5411] (2389)
  1d7f0494 chore: upgrade timeago-react for react v17 (2404)
  ca4ab7ad fix: correct resource pool pagination and make sort sticky [DET-5482] (2403)
  c0e23f46 chore: add zmq-based IPC to the DistributedContext (2373)
  3a625b4d chore: make pip happy again (2399)
  3b08403f fix: add max size limit metrics [DET-4878] [DET-4783] (2387)
  212dd090 chore: remove upstreamed gradient aggregation test (2406)
  8287dbe2 fix: correct the url search param setting for archived (2402)
  41aea035 docs: Release notes for 0.15.5. (2397)

0.15.6.dev0

950a5911 docs: deprecate old master configuration fields (2395)
  c0821a9f fix: wire up support for plain-string image config (2393)
  09e1dd83 ci: temporarily remove flaky tests (2394)
  ea3f9321 chore: Edit docs for typos (2391)
  99188264 feat: support push metric APIs internally [DET-5215] (2315)
  7b62b334 chore: widen the trial link on experiment detail page [DET-5459] (2390)
  c8dd9fd8 feat: add SlotsPerAgent in resource pool API (2383)
  993a5268 fix: Fix nightly gpu tests for pytorch word language model [DET-5226] (2388)
  fb60a581 chore: move trial logs in a trial detail page tab [DET-5410] (2365)
  0f3ba700 refactor: experiment list native filters [DET-5389] (2378)
  7b58a2d2 chore: move Trial Information table in a dedicated Trial page tab [DET-5434 (2372)
  e1e4b9e0 feat: Add PyTorch Word language Modeling example to Determined's Example [DET-5226] (2352)
  9e20c3c0 chore: unrevert and fix "actually use expconf in the master" (2382)
  8560a969 chore: remove unused protobuf imports (2336)
  f41e788f chore: update gke version (2385)
  1b134e35 chore: simplify tensorboard request msg (2377)
  30ab1463 chore: close agents on websocket closures (2380)
  7d1509b8 docs: spelling fixes in model hub (2379)
  d0c66510 fix: `det deploy gcp` support for terraform 0.15 [DET-5449] (2376)
  cde5700c chore: revert "actually use expconf in the master" (2375)
  108462fc chore: move trial hyperparameters in a dedicated trial page tab [DET-5412] (2364)
  3197cc32 build: remove webui and docs as direct master dependencies (2363)
  3a545cdd feat: add a preview parameter to the notebook launch API (2359)
  fd145c92 chore: remove redundant model_hub line from bumpversion. (2374)
  1f573da8 docs: Release notes for 0.15.4. (2370)

0.15.5 not secure

eb1d8215 docs: Release notes for 0.15.5. (2397)

0.15.5rc1 not secure

a0f8ae6c docs: deprecate old master configuration fields (2395)

0.15.5rc0 not secure

6c51726f chore: close agents on websocket closures (2380)

0.15.5.dev0

Docker images
  
  - `docker pull determinedai/determined-master:0.15.5`
  - `docker pull determinedai/determined-master:5fe959f6`
  - `docker pull determinedai/determined-master:5fe959f61237b90b6af68999440fe6f52f734492`
  - `docker pull determinedai/determined-dev:determined-master-5fe959f6`
  - `docker pull determinedai/determined-dev:determined-master-5fe959f61237b90b6af68999440fe6f52f734492`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.15.5`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:5fe959f6`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:5fe959f61237b90b6af68999440fe6f52f734492`

0.15.4 not secure

664452a9 docs: Release notes for 0.15.4. (2370)

0.15.4rc1 not secure

6a70ea30 fix: fix task pagination filters not taking effect [DET-5442] (2367)

0.15.4rc0 not secure


        

0.15.4.dev0

1d85f24d chore: add unit tests for webui util functions [DET-5323] (2347)
  984ad9d4 feat: add workload status to trial infobox [DET-4289] (2349)
  2afc2e10 fix: ci/cd model-hub tests config (2358)
  f3f828b8 chore: fixes eslint error (2361)
  2f363ac3 chore: reorder migrations (2362)
  4a66da38 refactor: delete some old commands APIs (2321)
  9b5bc160 fix: ci/cd e2e tests timeout (2353)
  e37029f8 test: always calling read() before calling wait() (2356)
  d8c89211 feat: store the original user submitted experiment config in db (2332)
  f677fa71 feat: support transformers library in model-hub [DET-4823, 4719, 4721, 4720] (2068)
  fc27d779 fix: improvements to automatic pod spec configurator (2306)
  f6f13dc1 fix: hide expected network errors when nodes are terminated [DET-4822] (2351)
  63c77f27 chore: Add output printing to debug flaky test (2350)
  0f3be84c chore: drop prior_batches_processed and num_batches (2345) [DET-5403, DET-5405]
  1161b667 fix: fix test test cluster setup cmd. (2341)
  9de56d6b chore: migrate to use total_batches more in HP search viz. (2344)
  cceb7647 fix: react build should depend on its public dir (2339)
  0ddac86d chore: edits to expconf before enabling it (2342)
  cd2980b0 refactor: provide support for specifying selector for element id for element list (2255)
  dc229e58 feat: tolerate missing GPU stats when running under MIG [DET-5387] (2327)
  4160745d chore: disable webui experiment archive test (2340)
  8a0ced99 feat: add internal searcher APIs [DET-5214] (2301)
  cbafb09b feat: replace "show archived" toggle with dropdown [DET-3925] (2333)
  5952d379 fix: improve uPlot chart zooming experience [DET-5395] (2338)
  97ed53c5 chore: add searcher type to output of experiment APIs (2328)
  434578b0 docs: Release notes for 0.15.3 (2334)
  e77a16a1 chore: fix docstring (2337)
  5d608656 chore: add viewport meta to improve WebUI mobile experience [DET-5396] (2335)
  a0242e7d fix: system metric chart fix to support milliseconds [DET-5348] (2311)
  dca96c23 chore: sort nulls last in experiment trial API (2329) [DET-5300]
  cfb46abe chore: go mod fixes (2325)
  916b75ca chore: only select a single host port per container rendezvous port (2331)
  aafdcf0f chore: update package json [DET-5335] (2314)
  20263dc3 chore: use filelocks to guard data download (2244)
  4f12a6fd chore: loosen ruamel.yaml version (2313)
  7e6f51a4 revert: "revert: "fix: gracefully handle Docker binding published ports to ipv4 and ipv6 for host (2259) [DET-5295]" (2326)" (2330)
  1df87b2a docs: add a missing word in react readme (2324)
  953f5289 Revert "fix: gracefully handle Docker binding published ports to ipv4 and ipv6 for host (2259) [DET-5295]" (2326)
  2fa12d85 chore: update Docker images, AMIs and harness for yogadl update (2319)
  0d4cb140 ci: move CUDA 11 testing to more available GPUs (2316)

0.15.3 not secure


        

0.15.3rc3

b479d891 docs: Release notes for 0.15.3 (2334)

0.15.3rc2 not secure

8ced4c3d chore: go mod fixes (2325)
  63e8e11a chore: only select a single host port per container rendezvous port (2331)
  6b33c762 chore: loosen ruamel.yaml version (2313)

0.15.3rc1

f188b8dc chore: update Docker images, AMIs and harness for yogadl update (2319)

0.15.3rc0

b6d1bef1 chore: update environment images to 0.12.0. (2304)

0.15.3.dev0

Docker images
  
  - `docker pull determinedai/determined-master:0.15.3`
  - `docker pull determinedai/determined-master:b42d42bd`
  - `docker pull determinedai/determined-master:b42d42bdb1e66daadb0dc1a2dc8454b072bab774`
  - `docker pull determinedai/determined-dev:determined-master-b42d42bd`
  - `docker pull determinedai/determined-dev:determined-master-b42d42bdb1e66daadb0dc1a2dc8454b072bab774`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.15.3`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:b42d42bd`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:b42d42bdb1e66daadb0dc1a2dc8454b072bab774`

0.15.2 not secure

565e1876 docs: Release notes for 0.15.2 (2303)

0.15.2rc2 not secure

4527077f fix: scary warning with det shell open (2299)
  60221140 docs: add release notes for preemption in k8s (2294)
  56e87c2a fix: correct logic for hasData in uPlotChart (2292)

0.15.2rc1 not secure


        

0.15.2rc0 not secure

76fe5c8c fix: wait for uPlot to be ready to setData or setSize [DET-5343] (2283)
  c367fc77 feat: promote custom reducers from experimental [DET-5322] [DET-5321] (2284)
  f3636f3a docs: add docs for preemption in kubernetes (2289)
  88ca463e feat: allow activation of priority scheduler in k8s (2288)
  ae13bb6c fix: lr_scheduler step when using gradient_aggregation [DET-5289] (2271)
  c1fa9234 feat: add support for preemption in Kubernetes [DET-5135] (2282)
  54ff4ae6 fix: only warn for non-numeric np.dtypes [DET-5288] (2287)
  1f3b5e05 chore: remove svg and ttf fonts (2286)
  1454a155 chore: drop unused compose component (2278)
  1dc19d74 fix: squelch "response already committed" master log message (2281)
  b3c33a50 docs: add missing line to docker run (2285)
  7200fdf7 feat: expose user id as part of the user object [DET-4856] (2265)
  bea7875d fix: add support for dynamic section content via css [DET-5299] (2277)
  e1f14ba2 fix: improve rendering for uPlot chart with empty data [DET-5330] (2274)
  3b79dc57 chore: upgrade to labstack/echo v4.2.2 (2266)
  7239b108 chore: add support of y-axis zooming for uPlot [DET-5266] (2268)
  4b627002 expconf: fix some minor bugs in reflect code (2267)
  4bf88ba6 chore: fix typo in help for "experiment download". (2269)
  6c686929 fix: gracefully handle Docker binding published ports to ipv4 and ipv6 for host (2259) [DET-5295]
  6f5b86d3 ci: enable taiko get elements logging to help debug the disconnect from this and actual elements (2264)
  24ca0b50 fix: ignore hp-importance as a requirement for displaying hp-viz (2258)
  438a07ad fix: allow agents to be set to empty [DET-5296] (2261)
  56e6f0f8 chore: migrate webui to use /api/v1/auth/login  [DET-5287] (2254)
  97122918 chore: replace metric chart with uplot [DET-4303] (2234)
  468a70fe chore: clarify API for expconf objects (2256)
  524d3a3d chore: add option to login through the new api w/ pre-hashed pwd [DET-5270] (2253)
  d08a4492 chore: remove validation operations [DET-5213] (2189)
  cd64901f chore: add compression middleware to echo (2249)
  f18cb7dc docs: Release notes for 0.15.1 (2245)

0.15.2.dev0

3c2186ef Fix: remove quotes for Terraform 0.13 (2231)
  d23ebd05 docs: missing service-linked role [DET-5253] (2221)
  3cf12f4c feat: support configurable port and container name for Fluent Bit [DET-5272, DET-5273] (2251)
  6827390c chore: set the image used in ptl amp test through set_tf2_image (2194)
  259cff6a chore: trigger hp importance work on exp completion (2248)
  fe1540d4 chore: no pointers to maps or slices in expconf (2238)
  bd509a45 fix: `TFKerasTrial` check for tf2 behavior on 2.2.0 [DET-5277] (2246)
  afdb7495 feat: per-resource-pool configs [DET-5173] (2214)
  30a6af36 chore: reset error field on hp importance success (2242)
  6c47b6c9 style: transpose hp heatmap to better align the plot axes (2232)
  55cfe8f7 ci: fix windows test with lmdb 1.2 (2241)
  eb616933 fix: update label picker when labels change [DET-5254] (2239)
  6ab22682 chore: submit partial hp importance work to pool (2240)
  c2f96cbc chore: allow dev lint errors (2218)
  8de19c06 chore: fix panic from dependency creation race (2233)
  782d095a fix: select snapshot version with snapshot (2235) [DET-5264]
  cac640b8 chore: up circle ci timeout for e2e tests (2237)
  0b2c2c7e fix: actually support add/drop capabilities in structs (2236)
  1c926e03 chore: fix panics in hp importance actor (2230) [DET-5263]
  e21dc318 feat: support healthchecks for `det deploy aws` with TLS enabled. (2207)
  88b86894 docs: Release notes for 0.15.0. (2225)

0.15.1 not secure

a18a1eb0 Fix: remove quotes for Terraform 0.13 (2231)

0.15.1rc0 not secure

5bc5826d chore: fix panic from dependency creation race (2233)
  2f73fbc3 chore: fix panics in hp importance actor (2230) [DET-5263]
  5dbee57a fix: select snapshot version with snapshot (2235) [DET-5264]
  922ff056 fix: TFKerasTrial on tf2 with tf.compat.v1.disable_v2_behavior. (2211)

0.15.1.dev0

Docker images
  
  - `docker pull determinedai/determined-master:0.15.1`
  - `docker pull determinedai/determined-master:0e002898`
  - `docker pull determinedai/determined-master:0e002898037e6a58ec764e42d5f4a611c35a718b`
  - `docker pull determinedai/determined-dev:determined-master-0e002898`
  - `docker pull determinedai/determined-dev:determined-master-0e002898037e6a58ec764e42d5f4a611c35a718b`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.15.1`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0e002898`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0e002898037e6a58ec764e42d5f4a611c35a718b`

0.15.0 not secure

3fc0fa60 docs: Release notes for 0.15.0. (2225)

0.15.0rc1 not secure

e7576a74 fix: force checkpoint GC to use the master's default environment (2229)
  1fa876b2 fix: allow scatter plot to re-render when data changes initially (2224)
  6c1d0130 chore: validate grid list enum value from local storage (2223)

0.15.0rc0 not secure


        

0.15.0.dev0

99147a31 chore: lock api state for backward compatibility check
  418d49ae chore: bump to 2 agents on latest master [DET-5241] (2219)
  35ae0800 test: add more lr scheduler tests for lightning [DET-5223] (2184)
  93ae7559 chore: stop using cloudpickle to write PyTorch checkpoints [DET-5175] (2204)
  631dce25 docs: Various fixes (2209)
  0397c1c7 feat: add git and ide content to detignore by default [DET-2832] (2210)
  52e57558 chore: pull EE CLI features and docs into OSS [DET-3912] (2195)
  40b414c4 feat: move executables to the main package and update docs. (2187)
  26404b83 chore: Update to Ubuntu 20.04 for agent, master and bastion images [DET-5238] (2208)
  154e1c6a docs: clarify k8s default pod spec behavior (2197)
  f091d5fc chore: avoid rerendering experiment list if api response remains the same (2203)
  364c0bc2 chore: show trial metrics on webui [DET-5060] (2167)
  c648f157 chore: update estimators test fixture to not reference adaptive searcher (2205)
  225ccde5 chore: remove sha [DET-5225] (2181)
  469fb25a refactor: consolidate global contexts (2186)
  66000fcd fix: disable det deploy wait for aws cluster on circleci. (2192)
  7de74411 chore: extend timeout on HP importance test (2193)
  3ccd0ca8 chore: make the first glasbey color our brand color (2190)
  5e68b752 feat: add key tracker (2188)
  376baa3b feat: health check master after cluster creation [DET-5183] (2164)
  5a35fc1e chore: enable hyperparameter importance computation by default (2159)
  6aa7a047 refactor: improve experiment terminal state [DET-5202] (2179)
  bf43f63a chore: remove stoksc from codeowners (2185)
  92056b56 fix: add efs, fsx, and govcloud templates to bumpversion [DET-5200] (2172)
  4cd06121 fix: mmdetection docker image to work with torch 1.7 (2183)
  d9790925 feat: local clusters to store checkpoint data in home [DET-5154] (2170)
  19ea6079 fix: bug in random search leading to incorrect total trials (2182)
  81276813 chore: idempotent searcher progress API [DET-5211] (2180)
  b60ac4ce feat: zoomed modal charts [DET-5111] (2174)
  fa9c7736 chore: `make` default goal should be `build`. (2177)
  d241fa96 docs: various fixes (2163)
  66f75d46 fix: allow telemetry to be disabled under Helm (2178)
  07ce81af fix: default tooltip prefix to be an empty string (2165)
  bb7f1ea7 chore: tweak step_lr param in e2e tests (2160)
  d96aced2 chore: add an example for checkpoint callbacks [DET-5186] (2173)
  ad3e0a4d refactor: update context api to reduce unnecessary re-renders [DET-5185] (2168)
  255fa74f chore: store the daily/monthly filter setting in local storage for cluster historical usage page [DET-5194] (2161)
  9e3c69d6 chore: wire up profiling configurations [DET-5064] (2122)

0.14.7.dev0

8a458c11 chore: add tab navigation to trial details page [DET-5070] (2162)
  14e9911d feat: `det deploy` check for sufficient gpu quotas on aws, gcp. (2136)
  f0faf474 chore: webui for resource allocation data [DET-5046] (2062)
  c636b453 chore: update codeowners (2145)
  4b7ef379 fix: add terraform files into default detignore [DET-5155] (2146)
  3537c21b fix: get cli wheel back into trail runner. (2156)
  e9892dde fix: fix an issue in wrapping lr_scheduler for lightningadapter (2154)
  8adc237e fix: roll back `det` and `det-deploy` executable move. (2153)
  5c8a17fb fix: avoid loop of effect in hp-viz when experiment is not supported [DET-5189] (2151)
  95a86385 fix: e2e test for pytorch lightning examples (2152)
  d6bccf60 fix: pytorch lightning example (2150)
  323f2729 fix: avoid showing no-data message for a split second in hp-viz [DET-5099] (2144)
  957ffd6b chore: add license information to Pytorch Lightning examples (2147)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.15.0`
  - `docker pull determinedai/determined-master:3a04e697`
  - `docker pull determinedai/determined-master:3a04e697706f25e6068b2bfe0f4ff3d9c8332ec9`
  - `docker pull determinedai/determined-dev:determined-master-3a04e697`
  - `docker pull determinedai/determined-dev:determined-master-3a04e697706f25e6068b2bfe0f4ff3d9c8332ec9`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.15.0`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:3a04e697`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:3a04e697706f25e6068b2bfe0f4ff3d9c8332ec9`

0.14.6 not secure

29d9988a docs: Release notes for 0.14.6. (2158)

0.14.6rc4 not secure

2e8c3645 fix: get cli wheel back into trail runner. (2156)

0.14.6rc3 not secure

0eaa03a8 fix: fix an issue in wrapping lr_scheduler for lightningadapter (2154)
  a451c2c9 fix: roll back `det` and `det-deploy` executable move. (2153)

0.14.6rc2 not secure

8024fc58 fix: avoid loop of effect in hp-viz when experiment is not supported [DET-5189] (2151)
  05645afa fix: e2e test for pytorch lightning examples (2152)
  78c7fe9b fix: pytorch lightning example (2150)
  9d5dff71 chore: add license information to Pytorch Lightning examples (2147)

0.14.6rc1 not secure


        

0.14.6rc0 not secure

1722f288 feat: precision prop and amp support for lightning adapter [DET-5116] (2127)
  567e237f feat: add allocation aggregation by agent label and resource pool (2141)
  cd39ce72 chore: upgrade taiko [DET-5157] (2134)
  17f77cad perf: support max_concurrent_trials for random and grid search (2137)
  ae505e48 ci: adding coscheduler to static k8s test clusters, and test (2139)
  f5723da5 chore: move unets_tf_keras back to previous images (2138)
  ec8f02bf chore: change output format of JSON aggregated resource data (2129)
  db16db4c style: trial log section filters [DET-5176] (2133)
  23b0945b fix: tweak aggregated resource allocation history endpoint (2123)
  b9d6b0c5 chore: downgrade dev pytorch package versions to 1.7.1 (2135)
  74be8a0e chore: Moving back to Python 3.7 and PyTorch 1.7.1 (2132)
  0b027d22 fix: package install order in requirements.txt (2131)
  169ad89c chore: ingest multiple batches to trial profiler metrics endpoint [DET-5178] (2117)
  eb45d53e fix: update scatter plot to support non-numeric values [DET-5110] (2126)
  8e130717 feat: rank hparams with hp importance [DET-5105] (2086)
  f172ed56 docs: improvements to spot instance and resource pool docs (2113)
  dc623d0b refactor: include det-deploy into det cli. [DET-5153] (2124)
  88044b08 chore: add webui tests lint step to CI (2115)
  59f48f3b feat: add pytorch checkpoint on load/save hooks [DET-5109] (2118)
  6f58a932 ci: put GPUs in a separate GKE node pool from master (2120)
  7e5c9c15 chore: remove cluster v1 page [DET-5163] (2114)
  6cb9445e chore: add server address to cli trial log download cmd [DET-5161] (2116)
  02598ce9 refactor: cleanup react hook dependencies [DET-5158, DET-5159, DET-5160] (2112)
  b0b03b2e style: update hp viz nav (2093)
  4b847724 refactor: combine common, cli, deploy into one python package. [DET-4756] (2108)
  b531bda5 build: local build improvements [DET-5118] (2060)
  202c4857 feat: add trial profiler metrics APIs [DET-5065, DET-5059] (2051)
  fe87adcb chore: add new ExperimentConfig objects (2066)
  3d2f54fa chore: add frequency parameter to wrap_lr_scheduler [DET-5148] (2087)
  18c39947 chore: add pytorch-lightning to docs requirements (2111)
  60ab3d36 chore: Revert "Testing gang-scheduling [DET-5134]" (2110)
  52a7fb32 chore: update to new images including TensorFlow, PyTorch, Python and CUDA upgrades (2074)
  2c5beaad Testing gang-scheduling [DET-5134] (2100)
  08d9562b feat: expose resource allocation endpoints in CLI [DET-5045] (2107)
  c346301e feat: colorize output info of det-deploy [DET-4749] (2102)
  c42069d4 feat: add aggregated resource allocation endpoint and job [DET-5044] (2085)
  9116b369 docs: Release notes for 0.14.5. (2098)
  6470983e docs: Release notes for 0.14.4. (2089)

0.14.6.dev0


        

0.14.5 not secure

493f0370 docs: Release notes for 0.14.5. (2098)

0.14.5rc1 not secure

2e2cceb1 Revert "chore: bump version: 0.14.5rc0 -> 0.14.5"
  2d16658e docs: further improve k8s coscheduling docs (2099)
  c4508da1 fix: broken doc links [DET-5100] (2101)

0.14.5rc0 not secure

0d861f0a feat: add batch margins [DET-5073] (2057)

0.14.5.dev0


        

0.14.4 not secure

515162c1 docs: Release notes for 0.14.4. (2089)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.14.5`
  - `docker pull determinedai/determined-master:f16dc9f1`
  - `docker pull determinedai/determined-master:f16dc9f1191a6e9b1b5c992ac39c6761ed176e20`
  - `docker pull determinedai/determined-dev:determined-master-f16dc9f1`
  - `docker pull determinedai/determined-dev:determined-master-f16dc9f1191a6e9b1b5c992ac39c6761ed176e20`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.14.5`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:f16dc9f1`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:f16dc9f1191a6e9b1b5c992ac39c6761ed176e20`

0.14.3 not secure


        

0.14.3rc3 not secure

710319b7 fix: support trial filtering based on categorical hps [DET-5107] (2045)
  c94c9f5d fix: trial log viewer scrolling issues [DET-5096] (2044)
  3010fd52 fix: nas dtrain bug (2039)
  51bf576e chore: tweak resource allocation endpoint (2040)
  9e409b27 docs: Release notes for 0.14.3. (2046)

0.14.3rc2 not secure

01680a48 fix: trial log viewer disappearing when re-clicking direction button [DET-5097] (2043)
  37f8d812 fix: decode categorical config hp properly [DET-5098] (2042)
  af5c6b81 fix: correct polling issues [DET-5095] (2036)
  37b0b757 docs: fix version of CUDA 11 image (2037)
  6404731e chore: remove petname dependency (2035)

0.14.3rc1 not secure


        

0.14.3rc0 not secure

26fc1a91 chore: lock api state for backward compatibility check
  b5b7a28b feat: add endpoint for raw resource allocation information [DET-5043] (2026)
  229a60ca feat: scatter plot and heat map [DET-4453, DET-4459, DET-5085] (2007)
  e7cd561f chore: support new tags and gov images in bumpenvs (2033)
  1f6e3a5b fix: bug in deformable detr experiment config (2031)
  231ed9ec chore: warn on failures to connect to persitent storage (2030)
  0e48beca fix: update experiment trials endpoint when changing filters or sorting (2029)
  a3d168ba feat: HP importance implementation [DET-4465] (1965)
  2992d5b1 fix: prevent stray logs during checkpoint loading (2028)
  90e4d4f3 feat: Support PyTorch-native Automatic Mixed Precision [DET-4753] (1914)
  1423f1e8 fix: fix shim for stateless searchers (grid, single, etc..) (2004)
  744c4ad1 fix: detect parcoords filter removal properly (2022)
  867dd841 feat: deformable detr with coco (1817)
  4806f1bc refactor: DETR example to use custom reducer and support finetuning (1816)
  623ab49c fix: fix forking failure on k8s [DET-5087] (2021)
  060a2b42 chore: exclude useless GPU type in resource pool API [DET-5079] (2016)
  b42dc3af docs: default agent docker image (1977)
  05d8fcd5 chore: update contributing.md with helm (2019)
  717b7277 docs: adding page for custom k8s master [DET-5022] (2013)
  afbb982c fix: navigation disappearing when going back from wait page [DET-5020] (2018)
  9c85ce11 chore: webui uniform path generation [DET-4734] (2010)
  6cb259eb chore: make experiment list page url sharable [DET-4745] (1999)
  43107475 refactor: remove app contexts [DET-2878, DET-4820] (1996)
  dd648f3f fix: remove react extra get-deps dependency target (2015)
  216f5cab feat: example using hp constraints for NAS (2014)
  66c55591 docs: HP Search Constraints Documentation [DET-4392, DET-4993] (1998)
  5542e115 build: build swagger api bindings [DET-5056] (2011)
  33252c14 chore: refactor cifar10_tf_keras example (2006)
  f5ca28b5 fix: fix deprecation warning on using ABCs from 'collections' (1997)
  fdc7c96d fix: fix query for GET experiments [DET-5039] (2012)
  02288260 fix: use UTC for all logs (1985)
  e00ab757 fix: correct batches column order on trial detail page [DET-5023] (2003)
  4a386cb7 docs: add link for enabling Kubernetes GPU support (2008)
  b397cfdf fix: fix v0 experiment snapshot shim (2002)
  28247048 chore: update cluster address for react preview [DET-5008] (1989)
  e085b91d feat: use anchor tags for navigation in tables rows [DET-4746] (1990)
  81d78004 ci: bump gke version (2000)
  96203a20 docs: fix missing parens in a few checkpoint API code snippets (2001)
  97632552 docs: add documentation regarding limit behavior for pagination (1995)
  ba47fe04 fix: call Sequence.on_epoch_end after validation (1991)
  19365f60 build: add instructions to point webui to cors disabled clusters (1988)
  c0e3a669 chore: refactor trial log viewer to improve rendering performance [DET-4866] (1974)
  b4f3be3c feat: enable parcoords in webui [DET-5013] (1994)
  3c18d5dc chore: tune parcoords [DET-4991] (1973)
  e4f010d5 feat: add v0 experiment config objects in python (1966)
  1c0cbdc7 chore: support task tokens in harness for authentication [DET-4897] (1894)
  b98ad557 chore: remove searcher emitted checkpoints [DET-4996] (1972)
  2ea6d16a chore: fix typo in comments (1986)
  7af2c1e7 feat: HP constraints harness exception handling [DET-4867] (1875)

0.14.3.dev0

9043ebda docs: Release notes for 0.14.2. (1983)
  a93fbfcb chore: remove parcoords temporarily (1982)
  dabbea46 fix: fix notebook state stuck in pending [DET-4988] (1981)
  e6fd27ae chore: check if experiment exists in /api/v1/experiments/:id/trials (1978)
  666fed0c fix: k8s resource pool API response should not have nil field [DET-4989] (1979)
  6e752136 fix: fix quoted string bugs for non-simple AWS deployments [DET-5001] (1980)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.14.3`
  - `docker pull determinedai/determined-master:d28d851d`
  - `docker pull determinedai/determined-master:d28d851d4f6409660103a4ba29c39e2fcf71c499`
  - `docker pull determinedai/determined-dev:determined-master-d28d851d`
  - `docker pull determinedai/determined-dev:determined-master-d28d851d4f6409660103a4ba29c39e2fcf71c499`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.14.3`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:d28d851d`
  - `docker pull nvcr.io/isv-ngc-partner/determined/determined-master:d28d851d4f6409660103a4ba29c39e2fcf71c499`

0.14.2 not secure

60456943 docs: Release notes for 0.14.2. (1983)

0.14.2rc3 not secure

fa45890f fix: fix notebook state stuck in pending [DET-4988] (1981)
  90256ac1 chore: remove parcoords temporarily (1982)

0.14.2rc2 not secure

bfb68050 fix: fix quoted string bugs for non-simple AWS deployments [DET-5001] (1980)
  68e4513a fix: k8s resource pool API response should not have nil field [DET-4989] (1979)
  e50cdcb3 chore: check if experiment exists in /api/v1/experiments/:id/trials (1978)

0.14.2rc1 not secure


        

0.14.2rc0 not secure

3df38beb fix: helm chart template dashes (1976)
  9592b64e ci: deploy NGC images as part of release [DET-4910] (1941)
  10086456 chore: fix squad example README links (1975)
  94b214f9 chore: lock api state for backward compatibility check
  5e3cb065 docs: Release notes for 0.14.1.

0.14.2.dev0


        

0.14.1 not secure


        

0.14.1rc2 not secure

0c229de4 fix: make trial log timestamp filters backwards compatible (1944)

0.14.1rc1 not secure


        

0.14.1rc0


        

0.14.1.dev0

6a902179 docs: Release notes for 0.14.1.
  3e00128a fix: add backwards compatability for logs before 0.13.8 (1942)
  db67b27a docs: More changes to release notes for 0.14.0. (1927)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.14.1`
  - `docker pull determinedai/determined-master:875429b1`
  - `docker pull determinedai/determined-master:875429b1b96bedcdd0a15bbb5f40a1957e00ee6e`
  - `docker pull determinedai/determined-dev:determined-master-875429b1`
  - `docker pull determinedai/determined-dev:determined-master-875429b1b96bedcdd0a15bbb5f40a1957e00ee6e`

0.14.0 not secure

e7da518d docs: Minor edits to release notes for 0.14.0.
  31c3ad4d edit
  b09f452c edit
  3f57533b Tweak release notes.
  90cdf6ec Tweaks for release notes.
  def0d9b5 docs: Release notes for 0.14.0.

0.14.0rc4 not secure

55a3c221 chore: revert default images and framework versions (1936)

0.14.0rc3 not secure

94820c35 docs: edit model debug doc (1925)
  ef702dc4 fix: correct the comparison function when numbers are fractions [DET-4969] (1924)
  8369e119 refactor: paginate experiment trials [DET-4900, DET-4921, DET-4922] (1892)
  017764f7 fix: correct cancel confirm button label to confirm [DET-4966] (1922)
  77740faa fix: buffer the trial log in the correct order [DET-4931] (1912)

0.14.0rc2 not secure

21c704d7 chore: improve resource pool details presentation [DET-4968] (1926)
  836414be fix: remove clickable style from trial info table [DET-4967] (1923)
  b03a1142 fix: add default query limit and add missing sort by state [DET-4919] (1921)
  fd94f24f docs: fix incorrect reference in docs (1919)
  7a1f0510 fix: typos in model debugging doc (1918)
  2b0031ee fix: fix a utilization calculation error in hgi resource bar for cpu slots [DET-4913] (1911)
  5de2710f docs: clean up resource pool docs and add release notes (1917)
  4f6b6ba8 feat: support resource pools in det-deploy local agent-up [DET-4938] (1906)
  6c3150be refactor: update active experiments [DET-4915] (1910)

0.14.0rc1 not secure

024b9fa2 fix: correct best and latest metric sort by params for the GET experiment trials API (1915) [DET-4920]
  733dca88 fix: add non-scalar metric expectation to protobufs [DET-4893] [DET-4911] (1876)
  fb4e119a feat: support more fields to sortBy in /api/v1/experiments/trials [DET-4219, DET-4920] (1899)
  a152fd32 chore: Bump images and versions to Tensorflow 2.4.1 (1913)
  099d5b2a chore: let CLI verify the master using combined system/custom certs (1859) [DET-4666]
  e25c0541 docs: add model debug doc (1895)
  c0dd89ec chore: reword resource pool ui presentation [DET-4925] (1898)

0.14.0rc0 not secure


        

0.14.0.dev0 not secure


        

0.13.14


        

0.13.14rc0

22be8c57 revert: "Revert "fix: migrate trial log ID to bigint (1792)" (1901)" (1902)
  1e0948d8 Revert "fix: migrate trial log ID to bigint (1792)" (1901)
  4a262bcb chore: save user preference for cluster view [DET-4926] (1896)
  63abc669 fix: migrate trial log ID to bigint (1792)
  142b9d1b feat: show resource pools without connected agents [DET-4924] (1897)
  6987f340 chore: move task messages to sproto (1891)
  24a12bc1 docs: add topic guide for commands and shells (1886) [DET-4901]
  c353fa1c feat: Documentation and CI of support for NVIDIA A100's and Google A2 instances (1888)
  fa85d33b chore: fix type errors in IPC code (1885)
  058ba7d4 docs: improve resource pool docs (1865)
  702add81 chore: update trials API name (1873)
  e434619a fix: render resource pools in order (1883)
  55885bc1 chore: Upgrading environment and dependencies to PyTorch 1.7 and TensorFlow 2.4 (1851)
  ea9abae4 fix: don't lose logs of short-lived commands (1882) [DET-4907]
  2b8a99cd fix: trial hangs when it fails to write to the DB (1877)
  d0b88fe4 fix: allow NULL trials.request_id for backwards compat (1881)
  434094bb ci: tolerate longer time for concurrent log uploading [DET-4908] (1879)
  facd7217 ci: fix kubernetes configuration resolution [DET-4909] (1880)
  fbe48c82 fix: CI failures caused by resource pool merge (1864)
  9a0d051e perf: move experiment API filtering and pagination to database [DET-4770] (1803)
  6be4a494 fix: altered tf.config function call to be compatible with tf 1.15 [DET-4852] (1836)
  e61415fc build: fix for `make check-schemas` on non-GNU build machines (1871)
  d4a6a9a1 chore: update CLI commands to display resource pool [DET-4677] (1709)
  85267b54 docs: clarify preemptible instance doc for static and dynamic agents (1870)
  811b7dc3 chore: expose det-deploy AWS profile support [DET-4891] (1868)
  4bc23a9d chore: restart in-progress HP importance computation on master restart [DET-4675] (1844)
  d114d469 chore: remove the deprecated PyTorch API [DET-3262] (1784)
  d97735f5 fix: allow larger gRPC response bodies (1869)
  f9876029 feat: enable new hgi-aware cluster page [DET-4854] (1855)
  7d4f2487 build: avoid re-downloading codegen binary (1838)
  916be666 fix: handle learning curve edge cases [DET-4832] (1827)
  3810fc51 ci: check that Go dependencies are tidied (1852)
  93332bf6 test: print start time for each E2E test (1867)
  b7721a40 fix: order migrations in the order they landed (1866)
  71aa544e expand details in resource pool modal  [DET-4884] (1857)
  54fd6fc4 feat: add resource pools (1846)
  05266cde docs: Release notes for 0.13.13. (1843)

0.13.14.dev0

5e88f2bd feat: swap master restart to be snapshot based (1745) [DET-816]
  0db30c6b fix: don't set default trial log limit in CLI (1856)
  a8d4dd2e fix: fix CLI log tailing with elastic (1853) [DET-4883]
  354cdfa8 fix: another place scheduler config for resource pool not being inherited (1854)
  831235eb feat: add resource pool column to tasks list (1831)
  a1821c84 feat: add resource pool column to experiment list (1819)
  ae8011ce fix: webui trial logs should not use negative offset (1845)
  d812ab78 chore: connect HGI UI to its API [DET-4638] (1837)
  4fb65a4f fix: scheduler config for resource pool not being inherited (1847)
  6523a8c5 chore: update cluster utilization overview [DET-4346] (1788)
  9f57c9d2 docs: fix readme to clarify gpu vs cpu
  5f661c86 chore: add custom error for torch's ReduceLROnPlateau (1849)
  2617e74f chore: bump taiko-video version to fix ffmpeg / screenshot save race condition (1850)
  2f2e5ee7 perf: index as few log fields as possible to increase elasticsearch ingest speed (1848)
  c6b1f0ed fix: increase trial log timestamp resolution to support milliseconds [DET-4861] (1841)
  f9d31f93 chore: enable some more Go linters (1839)
  b4b1fe20 chore: retry for more errors when uploading to GCS (1794)
  2d2e96e2 chore: fix duplicates in elastic log ids (1834)
  79fe5eaf chore: add missing apiKey update to internal streaming sdk (1833)
  57a9acc7 chore: update storybook to resolve github security vulnerability for highlight.js (1808)
  342527cd chore: fix trial log following logic (1832) [DET-4850]
  91e2800b chore: Endpoint and infrastructure for hyperparameter importance computation [DET-4464] (1707)
  adc4361e chore: experiment API returns resource pool info [DET-4572] (1711)
  9a6da7af chore: fix ExitedReason log (1829)
  cf3accbd fix: dars_penntreebank_pytorch example [DET-4841] (1822)
  513136d5 fix: show zoom out tip when zoomed into learning curve chart (1828)
  128531e9 chore: Revert DET-4688, do not support single-trial experiments in trials-sample endpoint [DET-4840] (1824)
  0b243346 fix: update model def button to be a raw link (1826)
  79df2101 chore: various elastic fixes (1825) [DET-4839]
  b8d9e20d fix: update types to support new log levels (1823)
  9ae41f51 fix: revert broken user-facing change with experiment config logic (1821)
  b0214984 docs: Add Lunch and Learn promotion to README.md (1815)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.14.0`
  - `docker pull determinedai/determined-master:9ee2fa43`
  - `docker pull determinedai/determined-master:9ee2fa4321ff127bd0a08a90d15fa524d73b597c`
  - `docker pull determinedai/determined-dev:determined-master-9ee2fa43`
  - `docker pull determinedai/determined-dev:determined-master-9ee2fa4321ff127bd0a08a90d15fa524d73b597c`

0.13.13 not secure

4da0d0e3 docs: Release notes for 0.13.13. (1843)

0.13.13rc5 not secure

f4471de6 fix: don't set default trial log limit in CLI (1856)
  ec928446 fix: fix CLI log tailing with elastic (1853) [DET-4883]
  c5513cf7 fix: another place scheduler config for resource pool not being inherited (1854)

0.13.13rc4 not secure

e2a7846a perf: index as few log fields as possible to increase elasticsearch ingest speed (1848)
  0a164fc4 fix: scheduler config for resource pool not being inherited (1847)
  c4dc32fb fix: webui trial logs should not use negative offset (1845)

0.13.13rc3 not secure

64a9ff2e chore: fix duplicates in elastic log ids (1834)
  882b7f96 chore: fix trial log following logic (1832) [DET-4850]
  376619a4 chore: add missing apiKey update to internal streaming sdk (1833)

0.13.13rc2 not secure


        

0.13.13rc1 not secure

60acd7db fix: dars_penntreebank_pytorch example [DET-4841] (1822)
  8dba87e3 fix: show zoom out tip when zoomed into learning curve chart (1828)
  9506c879 chore: Revert DET-4688, do not support single-trial experiments in trials-sample endpoint [DET-4840] (1824)
  14aa2430 fix: update model def button to be a raw link (1826)
  6ce28821 chore: various elastic fixes (1825) [DET-4839]
  a69fc054 fix: update types to support new log levels (1823)
  2b1790c8 fix: revert broken user-facing change with experiment config logic (1821)

0.13.13rc0 not secure

7fb39269 chore: lock api state for backward compatibility check
  6c1840eb chore: webui support elastic search trial logs [DET-4616] (1801)
  0dd69003 fix: requested stops shouldn't be treated as errored (1818)
  1bcda695 chore: Update bumpversion (1820)
  7808e6fa chore: return timestamp and level per log from trial logs API [DET-4825] (1814)
  0faa6772 docs: add doc for det-deploy aws list (1813)
  9dacff06 feat: enable learning curve [DET-4776, DET-4792] (1796)
  955177e3 fix: the examples that use the old Pytorch APIs (1810)
  b783761f chore: TF RNG in Estimators test would require a session [DET-4624] (1811)
  5296282b feat: det-deploy aws list (1790)
  cf40ffd1 chore: fix typos in release notes (1812)
  05210e01 chore: change API response fields from placeholders to empty strings (1809)
  9e687e50 fix: CI failing due to PR 1724 (1807)
  e3ab038e docs: release notes for 0.13.12 (1782)
  0a89c124 chore: remove tensorpack test [DET-4790] (1793)
  204d3639 feat: add HGI table view [DET-4634 DET-4637] (1778)
  fbec0fd6 refactor: update default user filters to All when user is an admin (1799)
  bcc8ef0e ci: reduce webui e2e-tests flakes [DET-4973] (1798)
  20f5fe14 feat: add API endpoint that returns information about the resource pool (1724)
  77df93f9 docs: deprecate old PyTorch API (1783)
  6608389a fix: always encode trial ID as string [DET-4789] (1800)
  9307710a feat: add metric column to learning curve table (1785)
  1e2dc787 chore: update trials-sample endpoint to support a single trial experiments [DET-4688] (1791)
  4dbf58d6 chore: return consistent log IDs [DET-4789] (1775)
  4c1be3a4 docs: add docs for elasticsearch-backed trial log features (1768)
  bb0ab5e9 chore: deal gracefully with missing values in trials-sample [DET-4771] (1787)
  74868c6f chore: tune Fluent Bit logging performance [DET-4714] (1643)
  552bc1fc docs: add instructions to get started with Determined locally to README and docs (1747)
  31c53ada style: update WebUI style (1767)
  4f608590 ci: build React in development mode for E2E tests (1789)
  78565332 fix: learning curve tuning [DET-4375, DET-4692, DET-4711] (1753)
  4c8af11e chore: remove IDE Setup how-to doc [DET-4769] (1786)
  e5928d9c test: add master logs test [DET-4684] (1780)
  4aab166c chore: Support changing default images in det-deploy on cloud [DET-4689] (1729)
  fd080e5b docs: edit YAML topic guide (1734)
  f7c99cef chore: put index template in integrations to avoid precision related race (1776)
  9005f14a fix: webui responsive table showing unneeded horizontal scrollbar [DET-4710] (1760)
  adf587df chore: resolve security packages [DET-4733] (1773)
  0bb5e51d refactor: update tests to be more reliable (1779)
  33e280c4 ci: remove Cypress WebUI tests [DET-4580, DET-4755] (1777)
  0cba2c50 docs: update task configurations [DET-4731] (1762)
  000a4ab6 ci: increase resource class for packaging steps (1774)
  3f82a2c0 feat: add hgi basic card view and slots bar [DET-4635 DET-4632] (1717)
  000293a6 fix: relax type expectations for hparam values [DET-4742 DET-4744] (1763)
  dff17aef feat: json-schema for experiment config validation (1715)
  6aaed778 fix: workaround boto3+minio bug (1770)
  805b29f9 docs: fix incorrect field name in docker registry creds docs (1769)
  f6107db7 fix: add endTime to metric workloads [DET-4743] (1772)
  b9ecc47d chore: fix up Go dependencies (1766)
  08a3e736 fix: use cookie token from sso if applicable (1765)
  c63891a9 chore: show more information if Fluent Bit exits (1764)
  7e170c02 chore: switch library used for connecting to PostgreSQL [DET-4592] (1761)
  97b929b1 chore: update caniuse dev package (1756)
  0015ebb0 chore: refactor & migrate experiment details and trials [DET-4020] (1730)
  ed557584 fix: check for a cookie token and verify auth with it [DET-4732] (1758)
  b86afac9 Release notes for 0.13.11 (1759)
  b5b41d22 fix: add boolean to accepted hparam types [DET-4727] (1757)
  a1fe795b fix: don't request GPUs for Fluent Bit container (1755)
  074eae5d feat: add searcher-specific InvalidHP logic [DET-4334, DET-4335] (1698)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.13.13`
  - `docker pull determinedai/determined-master:d352a9ba`
  - `docker pull determinedai/determined-master:d352a9ba0d291a75263d10218b70b88132e78678`
  - `docker pull determinedai/determined-dev:determined-master-d352a9ba`
  - `docker pull determinedai/determined-dev:determined-master-d352a9ba0d291a75263d10218b70b88132e78678`

0.13.12 not secure


        

0.13.12rc2 not secure

690c7252 docs: Minor grammatical change for 0.13.12 release notes.

0.13.12rc1 not secure

dcc59499 docs: Release notes for 0.13.12.

0.13.12rc0 not secure


        

0.13.12.dev0

a19eb5f7 fix: relax type expectations for hparam values [DET-4742 DET-4744] (1763)
  798fa4fe fix: add endTime to metric workloads [DET-4743] (1772)
  783c810d fix: use cookie token from sso if applicable (1765)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.13.12`
  - `docker pull determinedai/determined-master:6f6280e7`
  - `docker pull determinedai/determined-master:6f6280e7807996eebe64df4e3503d1b08fc63c57`
  - `docker pull determinedai/determined-dev:determined-master-6f6280e7`
  - `docker pull determinedai/determined-dev:determined-master-6f6280e7807996eebe64df4e3503d1b08fc63c57`

0.13.11 not secure


        

0.13.11rc6 not secure

8fdc4bc8 Revert "chore: add priority scheduling"

0.13.11rc5 not secure

03e64067 chore: add priority scheduling

0.13.11rc4 not secure


        

0.13.11rc3 not secure

de6b50d2 fix: check for a cookie token and verify auth with it [DET-4732] (1758)
  a743bf4a fix: add boolean to accepted hparam types [DET-4727] (1757)
  75cec074 Release notes for 0.13.11 (1759)

0.13.11rc2 not secure

8098ee41 fix: don't request GPUs for Fluent Bit container (1755)

0.13.11rc1 not secure


        

0.13.11rc0 not secure


        

0.13.11.dev0

31bb6608 chore: lock api state for backward compatibility check
  2242c2c0 chore: add F1 score example of pytorch custom reducers [DET-4724] (1752)
  b9a94084 chore: hack tensorboard support to include custom metrics (1750)
  ddf8ee9b feat: support custom reducers for PyTorch (1647)
  c7fdddfe feat: support agent label for "det-deploy local agent-up" [DET-4713] (1748)
  0e240a4c feat: learning curve [DET-4445] (1731)
  6a9e7313 style: sort interface keys and type literals (1699)
  1c818d83 test: fix race condition in test-intg-agent (1741)
  bab2337b chore: fix mnist_data_layer convergence test (1744)
  ef59afa4 ci: split CircleCI tests more carefully (1740)
  19474086 feat: enable configuring trial logs backend on Kubernetes (1737)
  f8b89bc8 ci: upgrade version of CircleCI Helm orb (1739)
  d1ece88f feat: rebase onto horovod 0.21.0 [DET-4668] (1720)
  883bc5df fix: command priority not respected [DET-4674] (1735)
  3f87b246 test: use updated GKE version (1736)
  c68b32bd docs: add topic guide for priority scheduler [DET-4670] (1703)
  15794fa0 chore: log through Fluent Bit on Kubernetes [DET-4622] (1712)
  bc523134 chore: migrate getInfo endpoint [DET-4406] (1713)
  8ccb40db fix: webui table horizontal scroll [DET-4660] (1722)
  f991a042 feat: support multiple backward call per train_batch in pytorch [DET-4667] (1732)
  ecc78539 build: allow parallel runs of js and css checks (1727)
  f5e58c37 chore: update to Go 1.15 (1716)
  ac52ec87 fix: fix asha max concurrent trials (1719)
  eb76a2c7 fix: fix missing field in function call from bad merge (1726)
  73799c52 test: integration test fluent with postgres and elasticsearch backends (1705)
  1a1f756e fix: accept None-type hyperparameters with --local (1704)
  01cde0f6 chore: webui MultiSelect storybook [DET-4040] (1714)
  5cc385ac docs: clarify some Kubernetes-related docs (1708)
  374e7477 fix: BERT SQuAD example works with latest stable transformers [DET-4680] (1718)
  8aa6cd36 feat: support order by in trial logs api [DET-4647] (1706)
  96ca8624 chore: update command APIs to return resource pool info [DET-4568 DET-4569 DET-4570 DET-4571] (1710)
  d71dfc4f style: fix mobile steps table [DET-4669] (1701)
  491b0629 chore: provide telemetry information in new api [DET-4642] (1672)
  2faade7b chore: priority scheduler unit tests [DET-4513] (1658)
  cc491f3c chore: migrate trial details endpoint [DET-4021] (1674)
  b3d70e22 chore: set MinCapacity for RDS for secure det-deploy to 2 (1535)
  7127d26a chore: Updates to 0.13.10.dev0. (1702)
  aa1389ef feat: hp viz skeleton [DET-4494, DET-4495, DET-4545] (1618)
  8b35c38c test: integration test elastic-backend trial logs APIs (1675)
  d5a913db chore: restart fluentbit on failures [DET-4665] (1696)
  194af457 chore: add resource pools mock api [DET-4639] (1662)
  b1c29306 chore: add dev hgi cluster page and stat overview [DET-4633] (1676)
  7ed21d40 style: correct mobile viewport [DET-4664] (1695)
  d9add138 feat: port of DETR (1470)
  39b6d12f chore: minor copy update to trial log datetime filters (1692)
  f0817a85 chore: hide the trial logs filters when there are no filter options (1690)
  f013ea35 chore: camelcase required api attr names [DET-4648]  (1681)
  c4c66943 chore: fix a shadow var declaration (1691)
  4b0cf95e fix: prevent mobile tabbar from opening new window for Master Logs [DET-4654] (1688)
  72cbe097 docs: document setting priorities in experiment config (1687)
  919d9cfb chore: sort trial log's filter options (1686)
  a29ccd63 fix: add abort controller to trial log endpoints (1689)
  db1b7ca7 chore: move dev dependencies out of dependencies (1679)
  5c69af38 fix: add K8s disclaimer for mmdetection example (1683)
  27a67b69 ci: fix windows cli test (1685)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.13.11`
  - `docker pull determinedai/determined-master:16860f3f`
  - `docker pull determinedai/determined-master:16860f3fd2495af53913a9a62d7330898004b671`
  - `docker pull determinedai/determined-dev:determined-master-16860f3f`
  - `docker pull determinedai/determined-dev:determined-master-16860f3fd2495af53913a9a62d7330898004b671`

0.13.10 not secure

898aa79d chore: lock api state for backward compatibility check

0.13.10rc6 not secure

6e6fa1cf chore: restart fluentbit on failures [DET-4665] (1696)
  df006946 style: correct mobile viewport [DET-4664] (1695)

0.13.10rc5 not secure

9bc6a9bb chore: fix a shadow var declaration (1691)

0.13.10rc4 not secure

434ed716 chore: hide the trial logs filters when there are no filter options (1690)
  ba1ac13e fix: prevent mobile tabbar from opening new window for Master Logs [DET-4654] (1688)
  fe052e42 chore: sort trial log's filter options (1686)
  a687ec57 docs: document setting priorities in experiment config (1687)

0.13.10rc3 not secure

554706dc fix: add abort controller to trial log endpoints (1689)

0.13.10rc2 not secure

9f67d2c9 ci: fix windows cli test (1685)

0.13.10rc1 not secure

099bfe44 fix: add K8s disclaimer for mmdetection example (1683)

0.13.10rc0 not secure


        

0.13.10.dev0

99a848fe feat: support configuring priority scheduler in det-deploy [DET-4508] (1682)
  90548653 feat: read logs from elasticsearch [DET-4621] (1637)
  7da12666 fix: let shells work through an HTTP proxy/load balancer [DET-4469] (1677)
  c4b2df06 feat: expose some Fluent Bit logs in the agent (1680)
  7106bef2 fix: flush searcher events more often (1673) [DET-4644]
  cc351eff fix: template merging for bind mounts [DET-4630] (1678)
  63e13a4e feat: webui trial logs improvements [DET-4228, DET-4480, DET-4481, DET-4482, DET-4483, DET-4594] (1650)
  f4a3afd4 feat: display task priorities in CLI [DET-4515, DET-4516] (1639)
  7ed3d9c9 feat: make priority scheduler configurable and add docs [DET-4641] (1667)
  33a1885f fix: limit concurrent restores to avoid resource exhaustion [DET-4556] (1666)
  a2a807e9 feat: allow task requests to receive a resource pool field [DET-4342] (1600)
  912c3756 chore: Improvements to metrics streaming endpoints [DET-4532] (1664)
  85fee6a2 docs: update k8s limitations [DET-4582] (1671)
  6bc6bd69 docs: reorganize the documentation contents (1663)
  155a08bc fix: allow dots in map config keys (1665)
  64550ef9 chore: Work around possible bug when PyTorch opens checkpoint files [DET-4614] (1661)
  3c276f09 ci: add taiko video plugin (1632)
  f1dbf701 chore: migrate fork/continue experiment endpoint [DET-4023] (1659)
  ab6bdfb4 test: integration test trial logs API (1456)
  6e0dae15 chore: Ensuring examples always call .contiguous() before .view() [DET-4613] (1653)
  d05e222b feat: priority scheduler with preemption [DET-4512] (1634)
  2e5f4afb8 chore: migrate wait page to react [DET-4521] (1559)
  7aecb058 test: add responsive navbar test [DET-4603] (1656)
  4ce46236 test: avoid swallowing errors in taiko (1657)
  466a2264 test: reduce state sharing between e2e tests [DET-4487] (1543)
  764e969e build: set server address for react preview build. (1654)
  d7fb07bf feat: allow default password in kubernetes [DET-4435] (1624)
  80187b9b feat: support elasticsearch as a trial logging backend [DET-4179] (1542)
  0929c8a7 refactor: update filters to dynamically collapse when needed [DET-4584] (1629)
  8f011f36 fix: don't replay duplicates in master restart [DET-4525, DET-4599] (1655)
  24428f63 feat: add telemetry for schedulers [DET-4517] (1645)
  ef4e9aef chore: extend on complete hook for overflow actions (1651)
  93007ed2 chore: webui migrate ActiveExperiemnt context polling to new API [DET-4068] (1560)
  d94824e6 refactor: add abort api calls [DET-4585] (1630)
  e7321ee8 docs: fix rest-api side menu links [DET-4598] (1640)
  03228072 ci: run some E2E AWS CI tests with the master using TLS [DET-4606] (1529)
  78ec837f feat: support validation_steps in configure_fit() [DET-4529] (1649)
  68b7b081 feat: support max zero slot containers for resource pools [DET-4309,DET-4340] (1507)
  7cbd5dad docs: fixes for EKS docs. (1636)
  dd89200d fix: limit --local --test mode to 1 gpu [DET-4602] (1648)
  f0eba7d4 chore: lock api state for backward compatibility check (1644)
  c26de486 chore: bump version: 0.13.8.dev0 -> 0.13.9.dev0 (1641)
  1ad40425 chore: fix fluent lua filter (1642)
  00d38b91 fix: fix wait page url for notebook launch request [DET-4586] (1631)
  fe901d56 fix: track best validation metric for darts_cnn hp search benchmark (1638)
  1d58515f feat: add documentation for AWS custom tags (1621)
  4dcea3e3 chore: Checking for more instances of EagerTensors [DET-4566] (1633)
  a77657d0 fix: unets_tf_keras example data download (1635)
  1210b771 feat: return resource_pool in agent GET APIs [DET-4567] (1616)
  422b900a chore: Adding NVIDIA Tesla A100 details for GCE [DET-4583] (1628)
  e3c240af style: responsive webui [DET-4417, DET-4420] (1501)
  2166ad79 test: fix CircleCI timing-based splitting for E2E tests (1622)
  be1ed46a chore: fix keras validation for dtrain (1626)
  8384e5c3 chore: increase timeout for metrics stream tests that are flaky on CI infra [DET-4581] (1627)
  fb86450f ci: add gauge taiko [DET-4576] (1619)
  4855079d feat: Add custom tags to AWSClusterConfig (1561)
  2e2c121c fix: clean up data handling in TFKerasTrial (1564)
  74d41962 feat: support per command shmSize settings [DET-4577] (1620)
  46017942 fix: tensorboard to load from experiment list via table batch (1617)
  4e7e7383 ci: update task and authentication tests to be more reliable (1614)
  b323c24a Fix typo (1611)
  bff6f277 feat: ALBERT on SQuAD 2.0 example (1609)
  d63897fd chore: include offset in trial log IDs returned to webui [DET-4561] (1608)
  e1e514a5 fix: webui unarchive button loading state [DET-4017] (1594)
  7509e1ae chore: webui migrate killTask to new API [DET-4019] (1589)
  8d1eb2d6 docs: Release notes for 0.13.8. (1603)
  4fbb6e87 fix: honor DET_MASTER_CERT_NAME with shells (1604)
  43f4901d chore: update CODEOWNERS to be opt-in (1595)
  eb45feed chore: add endpoint to stream trial log fields [DET-4479] (1537)
  769a5d83 chore: bumpenvs (1599)
  55c6ccb7 fix: allow clients to override the expected master cert address [DET-4547] (1588)
  da38010b feat: change the priority scheduler to round robin scheduler [DET-4514] (1596)
  6a37a5f9 chore: bumpenvs for NCCL update (1593)
  ec9abc31 feat: implement API endpoint for sampling streams of metrics from the best trials [DET-4441] (1571)
  d8f536ee feat: InvalidHP Searcher Ability [DET-4333] (1550)
  8b6ab794 feat: propagate task priorities to resource pools [DET-4510] (1577)
  e8a6784e chore: Tag metrics streaming APIs as Internal due to their less-stable or less-supported status [DET-4546] (1592)
  175f2125 fix: issue data type warnings in to_device (1591)
  5e9b6d38 fix: tensorboard with absolute storage_path (1590)
  aedab511 chore: webui migrate getAgents to new API [DET-3844] (1576)
  1bb19906 feat: add configurations for priority scheduler [DET-4507, DET-4509] (1565)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.13.10`
  - `docker pull determinedai/determined-master:93b93697`
  - `docker pull determinedai/determined-master:93b9369791d9278be68e720ab65c5328d3fed5b9`
  - `docker pull determinedai/determined-dev:determined-master-93b93697`
  - `docker pull determinedai/determined-dev:determined-master-93b9369791d9278be68e720ab65c5328d3fed5b9`

0.13.9 not secure

149a461b docs: Release notes for 0.13.9 (1623)

0.13.9rc4 not secure


        

0.13.9rc3

bb8be819 docs: More changes for release notes for 0.13.9.
  cbcc7b1f feat: support per command shmSize settings [DET-4577] (1620)

0.13.9rc2 not secure

d7b20a31 fix: tensorboard to load from experiment list via table batch (1617)

0.13.9rc1 not secure


        

0.13.9rc0 not secure

fa24de5e docs: Release notes for 0.13.9.
  c96d738c chore: include offset in trial log IDs returned to webui [DET-4561] (1608)

0.13.9.dev0

5264be30 docs: Release notes for 0.13.8. (1603)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.13.9`
  - `docker pull determinedai/determined-master:3f0ec0ce`
  - `docker pull determinedai/determined-master:3f0ec0ce8d9dbe4256a630156480661f3a8c2ff1`
  - `docker pull determinedai/determined-dev:determined-master-3f0ec0ce`
  - `docker pull determinedai/determined-dev:determined-master-3f0ec0ce8d9dbe4256a630156480661f3a8c2ff1`

0.13.8 not secure


        

0.13.8rc4 not secure


        

0.13.8rc3


        

0.13.8rc2

b9c36f18 fix: honor DET_MASTER_CERT_NAME with shells (1604)

0.13.8rc1 not secure

27439a53 chore: bumpenvs (1599)
  9aedc579 fix: allow clients to override the expected master cert address [DET-4547] (1588)
  66fa2ce2 chore: bumpenvs for NCCL update (1593)
  0bb6bcb0 chore: Tag metrics streaming APIs as Internal due to their less-stable or less-supported status [DET-4546] (1592)

0.13.8rc0 not secure

9f8f3067 chore: fix old checkpoint export apis [DET-4538] (1582)
  9b5dcb48 feat: support trial log filtering in CLI [DET-4489] (1429)
  38c2c8fb docs: Document which network ports PEDL uses for inter-agent communic… (1464)
  ac1a57c4 chore: webui migrate patchExperiment to new API [DET-4017] (1557)
  69c3ae7c docs: add rest api to the references toc [DET-4381] (1586)
  33aed968 fix: fix sign in page link to docs (1587)
  be27ffe5 fix: restore keras TensorBoard wrapper (1580)
  c0a36031 chore: unets_tf_keras README note (1581)
  b1c8c1cb fix: check for error when converting k8s watch objects (1572)
  d18b187d chore: add current page to trial logs breadcrumb [DET-4530] (1569)
  6442e38e chore: server-side hashing for password change [DET-4534] (1574)
  743ed587 fix: fix wait page asset paths [DET-4496 DET-4497] (1551)
  3e387e0d test: use synthetic data for gpu detection test (1573)
  a850b043 fix: avoid hangs during validation for TFKeras [DET-4434] (1555)
  71c98458 fix: fix experiment archive action going out of sync [DET-4535] (1575)
  899913e2 chore: add a link to release notes in update notification [DET-4097] (1552)
  95d257d4 fix: fix ctr-click failing to open experiment rows [DET-4531] (1570)
  911355a6 feat: support configure_fit() in TFKerasTrial (1566)
  43a18a77 docs: clean up API docs [DET-4399] (1568)
  d6ea9f1a feat: add streaming endpoint for trial metrics at a specific point [DET-4440] (1562)
  e8a5a149 feat: add navigation sidebar and breadcrumbs to log views [DET-4394] (1546)
  2b96d770 fix: use agent's master location overrides for Fluent Bit (1567)
  df516839 chore: fix reporting for verbose=1 (1563)
  5b733313 feat: update trial route [DET-4402] (1549)
  bcc0ea8f add a test to disable and enable slots. (1548)
  ce2db3ee fix: webui label select misalignment [DET-4416] (1554)
  0dba3de4 feat: add streaming endpoints with metric metadata for future UI work [DET-4439] (1538)
  765275a0 feat: add support for TFKeras models that subclass tf.keras.Model [DET-4393, DET-4103, DET-3257, DET-3217] (1495)
  5bdc0e33 build: add a pre-release script [DET-4414] (1505)
  85d6e68c chore: correct the percentage reporting for verbose=1 (1558)
  9567e8e3 fix: make trial actor still handle `ContainerLog` messages (1553)
  264c8fb9 feat: use Fluent Bit for trial logs [DET-4178] (1462)
  108a0e1d fix: fix disabling slots [DET-4492] (1547)
  47f35726 fix: fix not being able to find resource pool [DET-4477] (1544)
  7f57be6b fix: keras callbacks [DET-4299] [DET-4202] (1458)
  e5aeb589 style: add alert icon [DET-4430] (1532)
  58ba70fa feat: make slot type configurable. [DET-4308] (1484)
  3e23aed6 chore: remove protonet_omniglot_pytorch nightly test (1541)
  e4a13764 test: avoid clearing auth token in e2e tests [DET-4486] (1539)
  14ebb553 feat: portable webui builds and pr preview [DET-4324] (1422)
  a152663a chore: bump lib pq (1533)
  fa2d5921 chore: lower threshold for some convergence tests (1530)
  5566a3b0 chore: webui make page/tab title more descriptive [DET-2151] (1486)
  4d66329d fix: make pip happy again (1534)
  6958dc6c feat: support filtering in trial logs api [DET-4177] (1427)
  9f7ed8f9 chore: fix master and agent release targets (1531)
  590d2dda chore: webui migrate killExperiment to new API [DET-4016] (1521)
  6c7fdddf feat: allow tensorboard to run startup hooks [DET-4187] (1463)
  1a979cc8 feat: allow det-deploy aws to specify subnet for simple deployment type (1515)

0.13.8.dev0

d655dad1 docs: Release notes for 0.13.7. (1526)
  2bab88a4 chore: add new fields to trial logs [DET-4176] (1373)
  7fb09b49 docs: add FAQ about TF2 (1523)
  279c5c6f chore: make gen-attributions.py play nicer (1525)
  37fc88fa refactor: clean up code for master-sent messages (1520)
  d06dbea7 feat: make DB service type configurable in Helm chart (1522)
  c44c9f73 chore: fix e2e convergence tests (1524)
  80b47b51 docs: Add TLS disclaimer (1519)
  4d4b1e8a fix: saving & restoring RNG state for Keras & Estimators [DET-3743] (1492)
  711cb69b fix: add missing actor startup for create experiment (1517)
  8d246c63 test: add api pagination refactor and test [DET-4425] (1504)
  09a07f4d chore: sync agent go checksum file (1516)
  4c84784d fix: avoid showing tooltip when hovering outside of the nav bar [DET-4284 (1461)
  5c74a6c9 fix: incorrect directory path for mmdetection tests (1514)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.13.8`
  - `docker pull determinedai/determined-master:e1295346`
  - `docker pull determinedai/determined-master:e12953467c9de3d9289c8aca882d0993d89c23dd`
  - `docker pull determinedai/determined-dev:determined-master-e1295346`
  - `docker pull determinedai/determined-dev:determined-master-e12953467c9de3d9289c8aca882d0993d89c23dd`

0.13.7 not secure

193122db chore: fix master and agent release targets (1531)
  87cf2861 chore: sync agent go checksum file (1516)
  678a0ec5 docs: Release notes for 0.13.7. (1526)

0.13.7rc5 not secure

3eb54421 chore: make gen-attributions.py play nicer (1525)

0.13.7rc4 not secure

411e682f docs: Add TLS disclaimer (1519)
  19a113d2 fix: add missing actor startup for create experiment (1517)

0.13.7rc3 not secure


        

0.13.7rc2


        

0.13.7rc1


        

0.13.7rc0 not secure

fb509776 fix: webui experiment list goes to first page when changing filters [DET-4377] (1508)
  bce4fb6a fix: update postgres and perl version [DET-4395] (1491)
  137b806a fix: fix to allow det-deploy to upgrade existing clusters [DET-4427] (1511)
  f81fbe09 feat: make spot instances available in det-deploy [DET-4339] (1494)
  c03afa4f docs: spot instances docs and fixes [DET-4338] (1487)
  c3307db1 fix: disable deletion protection on Cloud SQL instances [DET-4428] (1512)
  a5ddcae0 test: disable cypress wait check [DET-4423] (1503)
  bee5c7aa fix branch name cannot contain slash for ci (1506)
  a02f1964 feat: add cluster name to master config [DET-3953] (1474)
  345c630c docs: minor updates to k8s docs (1509)
  11fce709 chore: autogenerate attributions files [DET-4433] (1493)
  8ecebed3 docs: remove outdated k8 limitations (1475)
  9bed2d58 test: add unit tests for agent resource manager [DET-4134] (1477)
  9b7a9a73 feat: support mmdetection library in Determined (1438)
  6e7a77d2 chore: remove protonet_omniglot_pytorch from nightly (1510)
  07ac1858 feat: improved metric selector [DET-4122] (1421)
  e7802dbe fix: fix api pagination index out of bound error (1499)
  87aa6fd6 docs: deprecate SHA notice [DET-4336] (1500)
  03e0a92f chore: basic create experiment endpoint [DET-4386] (1455)
  a1fe3dd2 chore: propagate horovod worker process retcode (1496)
  254766e4 docs: add remove steps topic guide [DET-3876] (1485)
  d13ec696 chore: fixing Huggingface example and nightly tests (1497)
  5e366ffe chore: webui stop polling on experiment terminal state [DET-4305] (1467)
  335e8a46 fix: webui remove checkpoint button for deleted checkpoint [DET-4286] (1466)
  8baef3d7 chore: make nightly tests parallel (1490)
  74f2c94e chore: split /master endpoint [DET-4408] (1483)
  6f32498d update time for convergence tests (1489)
  2a7ba138 chore: migrate webui get current user to new api [DET-4015] (1459)
  1cccde18 chore: fix nightly test failures [DET-4397] (1479)
  7711ef9d fix: fix handling of boolean hyperparameters in trial view [DET-4412] (1480)
  bd229fbe fix: preserve rank_id in logs [DET-4413] (1468)
  00939543 chore: update buf image (1478)
  04feba59 feat: add routing logic for multiple resource pools [DET-4132, DET-4302] (1398)
  64a0f8d3 feat: add shell, command, and notebook launch APIs [DET-4094] (1454)
  f090caf7 fix: fix setting default for fit field in master.yaml (1469)
  4c11398d fix: fix a parameter typo in ci (1473)
  c7e7e421 fix: fix missing swagger definitions [DET-4213] (1437)
  22cdc82e ci: show python environment via pip freeze (1472)
  29f62843 feat: webui add labels filter for experiments [DET-4117] (1465)
  f188a1d1 chore: webui improve float values vizualization [DET-4127] (1432)
  4832e4ca chore: set enable_cors for test aws cluster [DET-4326] (1439)
  3668ea5b chore: add tests for cheap examples (1430)
  17c64dad fix: rst formatting issue (1460)
  2630f748 docs: update CONTRIBUTING.md to point to new locations of examples [D… (1457)
  de2f55d1 fix: specify numpy version in requirements.txt (1408)
  b5399b2a feat: support AWS spot instances [DET-4191] (1415)
  c97dc523 fix: improve performance of agents endpoint for k8s [DET-4073] (1450)
  4e89b909 fix: set up swagger static deploy (1449)
  a6d5c3ce build: add buf breaking change detection (1442)
  3c233088 fix: change eks cluster setup docs to pass fmt check (1453)
  1a0954ab chore: react dependency update [DET-4379] (1447)
  267fca5e chore: add common launch params to tensorboard API [DET-4214] (1436)
  0c83af31 chore: add EKS cluster setup documentation [DET-4028] (1425)
  2f2d38ec style: add enforcement of reST formatting (1399)
  0a9c33ae fix: show tensorboard sources for CLI deployed tensorboards [DET-4372] (1441)
  321b263f fix: style fix for long checkpoint names and minor copy change for task batch modal confirmation [DET-4376] (1446)
  05634df5 fix: instruct protoc to generate camelCase names. (1448)
  f8440c1c fix: update task names for new trials [DET-4832] (1451)

0.13.7.dev0

3e0e299c docs: Clear out release note candidates.
  b945a275 docs: release notes for 0.13.6. (1444)
  415acf81 fix: update model registry link to REST API docs (1445)
  2cfc1662 fix: show more help text and version info in det-deploy [DET-4373] (1443)
  9d9f2c29 chore: remove useless support_determined_native calls (1440)
  3763bbd4 chore: remove tests for native parallel (1435)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.13.7`
  - `docker pull determinedai/determined-master:18ce58d8`
  - `docker pull determinedai/determined-master:18ce58d82ddeff7f03bb47ac43e8635aad7a691c`
  - `docker pull determinedai/determined-dev:determined-master-18ce58d8`
  - `docker pull determinedai/determined-dev:determined-master-18ce58d82ddeff7f03bb47ac43e8635aad7a691c`

0.13.6 not secure


        

0.13.6rc1 not secure

7b3ade0e docs: Clear out release note candidates.
  186e3486 fix: show more help text and version info in det-deploy [DET-4373] (1443)
  4b1cdd0d docs: release notes for 0.13.6. (1444)

0.13.6rc0 not secure

68bdba57 fix: correct `humanReadableFloat` error on Experiment Detail page [DET-4354] (1431)
  de4ec0ae feat: add opentracing to actor system [DET-4212] (1327)
  c9bb9b29 docs: add docs for TLS usage and configuration [DET-4364] (1419)
  f6e36fec feat: support storageClass configuration in Helm Chart [DET-4357] (1434)
  5f34b589 fix: webui metric chart not displaying log scale properly [DET-4246] (1418)
  588cb70e chore: add telemetry for k8s vs. agents [DET-4234] (1411)
  689b06f9 chore: update horovod version  (1413)
  e145d89f make AWS and GCP agent image optional (1417)
  c608fd12 ci: update gke version (1424)
  f0289b52 fix: webui preserve colors on metrics chart [DET-4247] (1400)
  70d3ae13 fix: make webui chart legend transparent to show data behind [DET-4218] (1420)
  a6cb9be6 docs: update to new version of custom Sphinx theme (1414)
  153131f4 fix: don't fail master if restoring non-terminal exp from DB [DET-4074] (1397)
  69afa950 chore: webui add experiment label editing on experiment detail page [DET-3972] (1356)
  e23d1130 chore: make `make -C proto build` idempotent (1390)
  bec697f3 fix: support for --local-state-path with det-deploy gcp [DET-4277] (1402)
  eb077439 fix doc for agent starting period and idle timeout (1416)
  bfcb8748 feat: support configuring CPU and Mem reqs for DB in helm chart [DET-4032] (1412)
  2564555a chore: handle failed build in update bumpenvs script (1395)
  92de5cd7 chore: remove mixed mode TLS workaround from the agent (1410)
  dbce5070 fix: always load default system TLS certificates in the harness (1409)
  1dd949f2 chore: add a formatter for protobufs (1405)
  2bd7e3aa fix: webui trial info checkpoint size label update [DET-4250] (1401)
  a867df30 fix: use the correct target cancel state for canceling experiment [DET-4257] (1404)
  ed51d5ad docs: bundle static swagger-ui with docs [DET-4210] (1376)
  53d974ea chore: fix dependency issue for windows tests (1406)
  6c9e9413 chore: add swagger authentication spec [DET-4272] (1396)
  4450b45c chore: include workloads in trial endpoint [DET-4036] (1342)
  8c8037f6 chore: add post experiment swagger spec (1363)
  46643b2a chore: rebase onto horovod 0.20.0 [DET-4225] (1388)

0.13.6.dev0

3361c3b2 docs: get rid of staged release notes for 0.13.5, in preparation for the next release.
  876833d0 docs: Release notes for 0.13.5. (1392)
  6701cd7c chore: set default startup in det-deploy to 20m (1394)
  c20deab9 chore: bump tf test versions (1378)
  5bf7eea4 chore: introduce resource pool and resource manager [DET-4131,DET-4136] (1365)
  c767f023 ci: update from deprecated remote docker versions [DET-4262] (1393)
  89a59063 fix: update agents context polling to block before next poll [DET-4264] (1385)
  5d2d4bc4 fix: det-deploy deprovisions GCP agents despite long master names [DET-4271] (1391)
  198d64eb fix: experiment chart legend labelling line as 'trace 0' (1389)
  19262505 feat: increase max disconnected and idle period [DET-4267] (1386)
  38b0c950 fix: commands (TensorBoards, notebooks, etc.) should not be preempted [DET-4157] (1346)
  20a7bc7a docs: fix broken links (1387)
  68f3568e docs: add a tf.layers-in-Estimator example (1383)
  4d0ba2f9 feat: don't log through agent 0 [DET-4180] (1344)
  f1ff54ec chore: fix typo in a docstring (1384)
  915fb50f fix: update percent utility to handle out of range numbers (1381)
  c8ee63e1 chore: fix possible syntax error when parsing experiment labels request [DET-4265] (1382)
  346bcc43 chore: fix typo in helm chart (1379)
  8d64cf9d fix: don't accept stale socket connections [DET-4203] (1367)
  5f4c4901 fix: webui tweak select for better layout [DET-4123] (1351)
  693098e1 fix: webui trial chart render metrics with same name [DET-4169] (1350)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.13.6`
  - `docker pull determinedai/determined-master:35cd77a2`
  - `docker pull determinedai/determined-master:35cd77a202dfe084a5c9655e8291f14c1a1c14a8`
  - `docker pull determinedai/determined-dev:determined-master-35cd77a2`
  - `docker pull determinedai/determined-dev:determined-master-35cd77a202dfe084a5c9655e8291f14c1a1c14a8`

0.13.5 not secure

e4c378f4 docs: Release notes for 0.13.5. (1392)

0.13.5rc2 not secure

17bbfb3e fix: update agents context polling to block before next poll [DET-4264] (1385)
  60cc59c1 fix: det-deploy deprovisions GCP agents despite long master names [DET-4271] (1391)
  dda3a612 fix: experiment chart legend labelling line as 'trace 0' (1389)
  8b263ca9 feat: increase max disconnected and idle period [DET-4267] (1386)
  681e3f55 docs: fix broken links (1387)
  d2963e93 docs: add a tf.layers-in-Estimator example (1383)

0.13.5rc1 not secure

44729d65 fix: update percent utility to handle out of range numbers (1381)
  864a03a1 chore: fix possible syntax error when parsing experiment labels request [DET-4265] (1382)
  12e5da5d chore: fix typo in helm chart (1379)
  9ab8c693 fix: don't accept stale socket connections [DET-4203] (1367)
  504936c8 fix: webui tweak select for better layout [DET-4123] (1351)
  5d2268fb fix: webui trial chart render metrics with same name [DET-4169] (1350)

0.13.5rc0 not secure

9cb10821 chore: bumpenvs (1377)
  b3dcd3ff feat: support new AWS regions [DET-3837] (1297)
  6ea89771 chore: replicate client side hash server side (1372)
  22cb5ee0 chore: update webui test tools (1368)
  2e8ba3e3 fix: send credentials for streaming endpoints in dev environment [DET-4240] (1359)
  365359ad chore: revert old login [DET-4242] (1370)
  3edc1103 feat: allow yogadl to connect to the master over TLS (1369)
  8df0554f chore: unit tests for kubernetes pod actor [DET-4107] (1353)
  4fa71b01 fix: check k8 message length before parsing [DET-4241] (1360)
  8a28e82e docs: specify that helm 3 is required (1371)
  cac219fd test: fix broken k8 test (1375)
  06a342de feat: support initContainers and sidecar containers for k8s [DET-4026] (1335)
  df799c67 fix: update test cluster port config for e2e-tests (1366)
  cf9b7178 chore: simplify provisioner and scheduler protocol (1352)
  55143288 chore: make Go tests and Python installs quieter (1364)
  cc420450 fix: update model registry to new api (1361)
  68e75c10 chore: Reduce default RDS capacity and enable auto pause. [DET-4220] [DET-4219] (1341)
  71affddd chore: add /api/v1 alias for create experiment endpoint [DET-4162] (1321)
  3834b2a4 chore: print error for non-release helm install (1349)
  a4269b3e chore: add basic create tensorboard endpoint [DET-4095] (1331)
  a9153884 test: update nightly test paths (1355)
  5ace732e chore: fixes for example configuration files (1358)
  2e429f30 fix: fail correctly on preclose checkpoint failures [DET-4189] (1323)
  41b8da0e Add integration tests for det-deploy local (1326)
  bd06f35d chore: add API call to list all defined labels for experiments [DET-4099] (1337)
  015903fc feat: restructure examples and add READMEs (1301)
  b09d4362 chore: allow dying workers to exit in peace (1347)
  321b8562 chore: quiet grpc log levels down (1343)
  35e8100a feat: expose _TrainContext.from_config() (1336)
  4f51c989 feat: introduce logStreamActor & notebook logs endpoint (1248)
  e669ab49 chore: format docs rst files (1345)
  9d3525cb chore: remove tp tests (1334)
  6b978d32 chore: remove zmq patches in keras [DET-2708] (1340)
  43209668 remove maxSlotsPerPod hardcoding from values.yaml (1339)
  bf20bf0f docs: ensure canonical URLs are set for generated doc pages (1338)
  9bae553e chore: small refactor of Helm chart (1330)
  84e67eb0 chore: provide best checkpoint, latest, and best validations for trial endpoints (1315)
  9a550f08 docs: Revise README and docs landing page (1325)
  c522b51b feat: support configuring TLS via Helm chart [DET-4030] (1310)
  c12c401b test: remove nightly tensorpack tests (1328)
  75e0fb02 feat: allow the master to do everything over just one port (1320)
  b6c4ff05 feat: allow dynamic agents to use TLS to connect to the master (1313)
  0a4dcc74 fix: don't unset allReady for trials [DET-4197] (1322)
  7ef772b4 docs: fix phrasing from Tensorpack removal (1324)
  1c8328d0 fix: Do not perform worker health checks during termination [DET-4175] (1318)
  8212f58c feat: "det-deploy gcp down" deprovisions dynamic agents [DET-4155] (1314)
  de3ee14a chore: remove support for TensorpackTrial [DET-4181] (1319)
  49e60d06 chore: remove dependency on React Plotly [DET-3724] (1287)
  7f1f13aa chore: add helper to register functions as actors (1302)
  36a0bcf4 refactor: use async polling [DET-4182] (1317)
  ff5e92df refactor: prevent rapid polling calls from making multiple polling timers [DET-4183] (1316)
  97d5c371 chore: reduce scheduler complexity [DET-4082] (1237)
  41fd9b35 chore: move workload out of searcher (1303)
  f527f0c1 fix: fix typo in Helm chart [DET-3781] (1312)
  d2344391 chore: sunset elm [DET-3907] (1164)
  2a396800 chore: bump version: 0.13.4.dev0 -> 0.13.5.dev0 (1311)
  43af93e6 chore: create storybook stories for Navigation and UserSelectFilter components [DET-4013] [DET-4041] (1240)
  971e0d1f chore: webui react storybook for StateSelectFilter component [DET-4039] (1252)
  b02935e8 docs: Release notes for 0.13.4. (1309)
  f7ee07a5 docs: REST API docs fixes (1308)
  9046f1bb fix: remove tensorboard source column from task list [DET-4173] (1306)
  83535254 fix: update command logs response expectation [DET-4167] (1304)
  44c68ee8 feat: lazily launch TensorBoards [DET-4156] (1293)
  03e11833 fix: deselect selected rows when table batch actions are done [DET-4128] (1299)
  
  
  
  Docker images
  
  - `docker pull determinedai/determined-master:0.13.5`
  - `docker pull determinedai/determined-master:91e70159`
  - `docker pull determinedai/determined-master:91e70159db41fb52dd677db5713f6edeb12a0430`
  - `docker pull determinedai/determined-dev:determined-master-91e70159`
  - `docker pull determinedai/determined-dev:determined-master-91e70159db41fb52dd677db5713f6edeb12a0430`

Links

Releases