-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathrunc0.log
More file actions
5895 lines (5895 loc) · 295 KB
/
runc0.log
File metadata and controls
5895 lines (5895 loc) · 295 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
logs_data/30hc/cgrew1/ablo/run_2024-08-10_20-39-00/run_1a172_00001_1_clip_discriminator=10,dr_cc=0.8000,seed=1_2024-08-10_20-39-00/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s1/08.10_20.39.49/models/ckpt_policy_T15000000.pt
0_hccustom_v0_type1-infogsdr_ppo-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s1
v_data: 0
Inferred from ckptpath name:
il_method: infogsdr
rl_method: ppo
activation: relu
hidden_size: [100, 100]
norm_obs: 0
info_loss_type: None
encode_sampling: normal
normalize_code: 0
tl_emb: 0
TRAINED C_DATA: 1
ARG C_DATA: 1
Using (100, 100) relu networks.
State dim: 17, action dim: 6, action bound 1
args.encode_dim: 2
TRAJ is loaded from /home/vsreeramdass3/code/vild_code1/imitation_data/STRAT_h5/HCCustom_v0.h5 with traj_num 25.0, data_size 25000 steps, and average return 996.96
No of trajs: Train 15 Val 3 Test 7
Total data pairs: 14985, K 15, state dim 17, action dim 6, a min -0.7615942, a_max 0.7615942
Policy model is loaded from logs_data/30hc/cgrew1/ablo/run_2024-08-10_20-39-00/run_1a172_00001_1_clip_discriminator=10,dr_cc=0.8000,seed=1_2024-08-10_20-39-00/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s1/08.10_20.39.49/models/ckpt_policy_T15000000.pt
Discr model is loaded from logs_data/30hc/cgrew1/ablo/run_2024-08-10_20-39-00/run_1a172_00001_1_clip_discriminator=10,dr_cc=0.8000,seed=1_2024-08-10_20-39-00/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s1/08.10_20.39.49/models/ckpt_discr_T15000000.pt
***** Max Ep Steps: 1000 args.seed 1 test_seed 1 *****
*** budgets = [10, 20, 30, 40, 50] NPARALLEL = 50 n_test_episodes = 5 ***
***** num zs = 1500 *****
***** args.encode_sampling = normal *****
current_vel_mean [0.97, 1.78, 2.88, 3.65, 4.87]
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
zvs shape 1500
RT 990.0781333333334 62.27448382108215 1000.0 -10.4
LR 0.9909333333333334 0.08341340153449897 1.0 -0.6
VL-std-all 0.1963888865915018 0.0
*** budget = 10
VL10-1 0.16801206981905045 0.1615379569118344
RT-VL10-1 997.3493333333334 1.2552156078627368 1000.0 994.0
VL10-2 0.17191424431709987 0.1590571645281902
RT-VL10-2 997.6 9.530750232799095 1000.0 882.0
VL10-3 0.10391029133864471 0.12059193384106831
RT-VL10-3 994.9546666666668 30.203003353235516 1000.0 752.4
VL10-4 0.21101357204694063 0.2214363148760211
RT-VL10-4 989.4373333333332 35.59493886620525 1000.0 752.0
VL10-5 1.1781033497506128 0.357921014437971
RT-VL10-5 991.112 30.524477434784476 1000.0 752.0
VL10-all 0.3665907054544697 0.40720766576862927
RT-VL10-all 994.0906666666667 3.2926217854807964 997.6 989.4373333333332
*** budget = 20
VL20-1 0.08507422157079869 0.06945277072765164
RT-VL20-1 997.2266666666667 1.2067955732250417 999.6 994.0
VL20-2 0.08951715515184165 0.06592676844697425
RT-VL20-2 998.4800000000002 1.0949581422745516 1000.0 995.2
VL20-3 0.05117239126820821 0.061683390393260006
RT-VL20-3 990.7626666666666 42.28589291425163 1000.0 752.4
VL20-4 0.08715876910251656 0.07197340430924506
RT-VL20-4 992.2133333333334 28.10626185666738 1000.0 846.0
VL20-5 0.9844894362213421 0.20570163144864584
RT-VL20-5 994.624 14.863537847138998 1000.0 916.4
VL20-all 0.2594823946629415 0.3627754383177759
RT-VL20-all 994.6613333333332 2.9105617938047588 998.4800000000002 990.7626666666666
*** budget = 30
VL30-1 0.061868732661371 0.05606575977492569
RT-VL30-1 997.2560000000001 1.073342443025523 999.2 994.0
VL30-2 0.06781491309445244 0.049427145026087534
RT-VL30-2 998.3439999999999 1.197357089593577 1000.0 995.2
VL30-3 0.02941530978389774 0.02587539364451623
RT-VL30-3 999.1599999999999 1.528921188289313 1000.0 990.4
VL30-4 0.05622760753296705 0.04460871066543701
RT-VL30-4 991.0320000000002 31.021756494434666 1000.0 846.0
VL30-5 0.9354776033003777 0.19036405638131876
RT-VL30-5 992.4320000000001 20.07136706853821 1000.0 892.4
VL30-all 0.23016083327461317 0.3529027365226838
RT-VL30-all 995.6448 3.2814108185351207 999.1599999999999 991.0320000000002
*** budget = 40
VL40-1 0.05119908027110096 0.048047727796415474
RT-VL40-1 997.2864864864864 0.9935364524011152 998.8 995.2
VL40-2 0.04824136011129647 0.043241749520412126
RT-VL40-2 998.443243243243 1.095578470067744 1000.0 995.2
VL40-3 0.020013315685515936 0.018629902450072396
RT-VL40-3 999.3189189189186 0.9057885433796043 1000.0 995.2
VL40-4 0.048474960049929 0.04186831026724433
RT-VL40-4 988.010810810811 35.57034738171809 1000.0 846.0
VL40-5 0.8697027116649938 0.16149132808900607
RT-VL40-5 994.972972972973 12.779219801263276 1000.0 922.4
VL40-all 0.20752628555656724 0.3312841499062334
RT-VL40-all 995.6064864864863 4.068104516952092 999.3189189189186 988.010810810811
*** budget = 50
VL50-1 0.03394886226007274 0.026711677288334013
RT-VL50-1 997.1333333333332 1.0637460014286997 998.8 994.0
VL50-2 0.041336687590328255 0.03938448991591725
RT-VL50-2 998.4266666666665 1.0376040777783304 1000.0 995.2
VL50-3 0.01546289233869098 0.01578457973828372
RT-VL50-3 999.5599999999998 0.4659041389241716 1000.0 998.4
VL50-4 0.03755711107021984 0.03200338146529991
RT-VL50-4 988.9866666666666 36.1463297660434 1000.0 846.0
VL50-5 0.8543941071537223 0.13351836159579894
RT-VL50-5 991.4933333333333 21.993437405028097 1000.0 892.4
VL50-all 0.19653993208260684 0.3290472459829866
RT-VL50-all 995.1199999999999 4.1345288593609455 999.5599999999998 988.9866666666666
logs_data/30hc/cgrew1/ablo/run_2024-08-10_20-39-00/run_1a172_00003_3_clip_discriminator=10,dr_cc=0.8000,seed=2_2024-08-10_20-39-00/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s2/08.10_20.39.48/models/ckpt_policy_T15000000.pt
0_hccustom_v0_type1-infogsdr_ppo-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s2
v_data: 0
Inferred from ckptpath name:
il_method: infogsdr
rl_method: ppo
activation: relu
hidden_size: [100, 100]
norm_obs: 0
info_loss_type: None
encode_sampling: normal
normalize_code: 0
tl_emb: 0
TRAINED C_DATA: 1
ARG C_DATA: 1
Using (100, 100) relu networks.
State dim: 17, action dim: 6, action bound 1
args.encode_dim: 2
TRAJ is loaded from /home/vsreeramdass3/code/vild_code1/imitation_data/STRAT_h5/HCCustom_v0.h5 with traj_num 25.0, data_size 25000 steps, and average return 996.96
No of trajs: Train 15 Val 3 Test 7
Total data pairs: 14985, K 15, state dim 17, action dim 6, a min -0.7615942, a_max 0.7615942
Policy model is loaded from logs_data/30hc/cgrew1/ablo/run_2024-08-10_20-39-00/run_1a172_00003_3_clip_discriminator=10,dr_cc=0.8000,seed=2_2024-08-10_20-39-00/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s2/08.10_20.39.48/models/ckpt_policy_T15000000.pt
Discr model is loaded from logs_data/30hc/cgrew1/ablo/run_2024-08-10_20-39-00/run_1a172_00003_3_clip_discriminator=10,dr_cc=0.8000,seed=2_2024-08-10_20-39-00/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s2/08.10_20.39.48/models/ckpt_discr_T15000000.pt
***** Max Ep Steps: 1000 args.seed 1 test_seed 1 *****
*** budgets = [10, 20, 30, 40, 50] NPARALLEL = 50 n_test_episodes = 5 ***
***** num zs = 1500 *****
***** args.encode_sampling = normal *****
current_vel_mean [0.97, 1.78, 2.88, 3.65, 4.87]
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
zvs shape 1500
RT 966.0994666666668 134.4202922170442 1000.0 -306.4
LR 0.9685333333333335 0.14704823093877134 1.0 -1.0
VL-std-all 0.3428453207242678 0.0
*** budget = 10
VL10-1 0.11280558515024867 0.10376336287598616
RT-VL10-1 991.9626666666667 67.3521348799642 1000.0 170.0
VL10-2 0.2065152615986725 0.16123020964989992
RT-VL10-2 976.0186666666666 111.75346520304812 1000.0 334.8
VL10-3 0.23968181425773183 0.2517525739175653
RT-VL10-3 959.664 83.60289730226658 1000.0 632.4
VL10-4 0.2896309580518288 0.2525700635820907
RT-VL10-4 970.5413333333335 51.73784454557633 1000.0 754.8
VL10-5 0.48878941810649756 0.472246030073284
RT-VL10-5 986.5946666666667 29.60788135315025 1000.0 838.0
VL10-all 0.26748460743299585 0.12480555792682081
RT-VL10-all 976.9562666666667 11.48167013480376 991.9626666666667 959.664
*** budget = 20
VL20-1 0.06417209954365856 0.06513319492164056
RT-VL20-1 997.2266666666665 1.3581687016796633 999.6 994.0
VL20-2 0.1147395701222456 0.09625284866390663
RT-VL20-2 970.7626666666666 118.30612807270617 1000.0 432.8
VL20-3 0.10338975969199152 0.09234645023228176
RT-VL20-3 954.1866666666666 83.44774226358008 1000.0 718.0
VL20-4 0.15238591465168233 0.1494076042027167
RT-VL20-4 973.1253333333334 51.56141992700443 999.6 754.8
VL20-5 0.24240179053748204 0.2549467332126076
RT-VL20-5 991.7653333333335 20.90385287187242 1000.0 865.2
VL20-all 0.135417826909412 0.060436455118606125
RT-VL20-all 977.4133333333333 15.496092651590159 997.2266666666665 954.1866666666666
*** budget = 30
VL30-1 0.037915341232513916 0.031562570113196756
RT-VL30-1 996.9839999999999 1.497111886266356 1000.0 994.0
VL30-2 0.07045969111257534 0.05601742179300732
RT-VL30-2 987.552 77.48947345285036 1000.0 445.2
VL30-3 0.07101563755093356 0.06528658597047479
RT-VL30-3 958.3679999999999 76.33726597147687 1000.0 718.0
VL30-4 0.10685276974258986 0.1278872563857188
RT-VL30-4 971.544 56.11895993334161 1000.0 754.8
VL30-5 0.16383373934743392 0.17839406080628897
RT-VL30-5 995.2879999999999 11.176343588132928 1000.0 926.0
VL30-all 0.09001543579720932 0.04287301503426448
RT-VL30-all 981.9472 14.83038316969591 996.9839999999999 958.3679999999999
*** budget = 40
VL40-1 0.029869546446021442 0.02058216446068455
RT-VL40-1 974.8648648648649 133.81916037862885 1000.0 172.0
VL40-2 0.054474984157935835 0.04136405712361246
RT-VL40-2 998.8216216216215 1.047812427802244 1000.0 996.0
VL40-3 0.05810390820942648 0.052352096914526906
RT-VL40-3 955.3513513513514 82.63353687875623 1000.0 718.0
VL40-4 0.08293639181692272 0.10083707450732818
RT-VL40-4 969.9135135135136 62.51777763234756 999.6 754.8
VL40-5 0.15082530905312774 0.14685256606738112
RT-VL40-5 993.5135135135133 16.23772987332301 1000.0 926.0
VL40-all 0.07524202793668684 0.041365886794709615
RT-VL40-all 978.4929729729729 15.881911335536074 998.8216216216215 955.3513513513514
*** budget = 50
VL50-1 0.024866040181841652 0.018269522559511118
RT-VL50-1 969.6000000000001 148.48946090547975 1000.0 170.0
VL50-2 0.04504027468834274 0.04103315729795057
RT-VL50-2 998.88 1.104656809451093 1000.0 996.0
VL50-3 0.04289270710978707 0.03768278017639061
RT-VL50-3 960.5733333333334 76.166882275406 1000.0 718.0
VL50-4 0.08091316760601582 0.10972447101871188
RT-VL50-4 962.2 68.52018680651712 999.2 754.8
VL50-5 0.12542783362923143 0.18733725893786937
RT-VL50-5 997.6533333333335 2.40384876581055 1000.0 986.4
VL50-all 0.06382800464304375 0.03576518384946905
RT-VL50-all 977.7813333333334 17.005161255728623 998.88 960.5733333333334
logs_data/30hc/cgrew1/ablo/run_2024-08-10_20-39-00/run_1a172_00005_5_clip_discriminator=10,dr_cc=0.8000,seed=3_2024-08-10_20-39-00/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s3/08.10_20.39.50/models/ckpt_policy_T15000000.pt
0_hccustom_v0_type1-infogsdr_ppo-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s3
v_data: 0
Inferred from ckptpath name:
il_method: infogsdr
rl_method: ppo
activation: relu
hidden_size: [100, 100]
norm_obs: 0
info_loss_type: None
encode_sampling: normal
normalize_code: 0
tl_emb: 0
TRAINED C_DATA: 1
ARG C_DATA: 1
Using (100, 100) relu networks.
State dim: 17, action dim: 6, action bound 1
args.encode_dim: 2
TRAJ is loaded from /home/vsreeramdass3/code/vild_code1/imitation_data/STRAT_h5/HCCustom_v0.h5 with traj_num 25.0, data_size 25000 steps, and average return 996.96
No of trajs: Train 15 Val 3 Test 7
Total data pairs: 14985, K 15, state dim 17, action dim 6, a min -0.7615942, a_max 0.7615942
Policy model is loaded from logs_data/30hc/cgrew1/ablo/run_2024-08-10_20-39-00/run_1a172_00005_5_clip_discriminator=10,dr_cc=0.8000,seed=3_2024-08-10_20-39-00/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s3/08.10_20.39.50/models/ckpt_policy_T15000000.pt
Discr model is loaded from logs_data/30hc/cgrew1/ablo/run_2024-08-10_20-39-00/run_1a172_00005_5_clip_discriminator=10,dr_cc=0.8000,seed=3_2024-08-10_20-39-00/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s3/08.10_20.39.50/models/ckpt_discr_T15000000.pt
***** Max Ep Steps: 1000 args.seed 1 test_seed 1 *****
*** budgets = [10, 20, 30, 40, 50] NPARALLEL = 50 n_test_episodes = 5 ***
***** num zs = 1500 *****
***** args.encode_sampling = normal *****
current_vel_mean [0.97, 1.78, 2.88, 3.65, 4.87]
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
zvs shape 1500
RT 948.0776 186.34060986512486 1000.0 -172.0
LR 0.9447999999999999 0.22825342640728677 1.0 -1.0
VL-std-all 0.4858099457045453 0.0
*** budget = 10
VL10-1 0.2145791573318249 0.23252642360723583
RT-VL10-1 968.7333333333333 150.72493563515704 999.6 83.6
VL10-2 0.15375827586561347 0.14766574644384048
RT-VL10-2 970.1653333333333 129.16275365427742 1000.0 298.4
VL10-3 0.12791794644797996 0.11519105461183511
RT-VL10-3 963.6533333333333 88.12528858896798 1000.0 538.4
VL10-4 0.2254770273792506 0.20230365025180846
RT-VL10-4 979.9413333333334 54.47211664288029 1000.0 764.0
VL10-5 1.2133957160229978 0.39608326458305726
RT-VL10-5 986.4773333333336 45.04210052334988 1000.0 764.0
VL10-all 0.3870256246095333 0.4147948182411685
RT-VL10-all 973.7941333333332 8.249452597597116 986.4773333333336 963.6533333333333
*** budget = 20
VL20-1 0.09743221393466747 0.08839159079326518
RT-VL20-1 985.4826666666667 94.1994881420394 998.8 175.2
VL20-2 0.07674490011291772 0.07872637896305733
RT-VL20-2 981.664 100.29307538741979 1000.0 359.6
VL20-3 0.06984977677099627 0.05727531039499511
RT-VL20-3 959.4026666666666 87.47334218428429 1000.0 670.0
VL20-4 0.1175739909392959 0.11935684039849793
RT-VL20-4 980.2346666666667 51.62697355280689 1000.0 776.0
VL20-5 0.9632684023818456 0.33534207596619653
RT-VL20-5 992.0853333333334 28.523958085947488 1000.0 776.0
VL20-all 0.2649738568279446 0.34954706169035293
RT-VL20-all 979.7738666666668 10.981196100810026 992.0853333333334 959.4026666666666
*** budget = 30
VL30-1 0.062481631599166934 0.04482310343659671
RT-VL30-1 980.12 114.99346068364062 998.8 175.2
VL30-2 0.05054928182656413 0.04434988019025417
RT-VL30-2 973.64 122.04191411150515 1000.0 359.6
VL30-3 0.048778415264521986 0.04441375636430553
RT-VL30-3 955.192 88.188252823151 1000.0 775.2
VL30-4 0.07593919779293168 0.08197533867150594
RT-VL30-4 972.5039999999999 55.58893760452704 1000.0 806.8
VL30-5 0.8291366277945081 0.2884394166121171
RT-VL30-5 994.5199999999999 11.12898917242712 1000.0 938.8
VL30-all 0.21337703085553858 0.30803377762980766
RT-VL30-all 975.1951999999999 12.708192356114186 994.5199999999999 955.192
*** budget = 40
VL40-1 0.06096766885099665 0.04798875413638915
RT-VL40-1 974.5729729729729 133.23181311644967 998.8 175.2
VL40-2 0.04349701256904264 0.039230073364321036
RT-VL40-2 964.8648648648649 140.82212705475294 1000.0 359.6
VL40-3 0.03290429475115024 0.028884346968788055
RT-VL40-3 942.4324324324323 101.28742632686998 1000.0 670.0
VL40-4 0.061120295214230846 0.05224821016701737
RT-VL40-4 979.0918918918917 53.02447358648125 1000.0 806.8
VL40-5 0.7693743983196811 0.2777625836041224
RT-VL40-5 990.897297297297 18.322375723547914 1000.0 910.8
VL40-all 0.19357273394102031 0.2881015022558778
RT-VL40-all 970.3718918918918 16.284474144836476 990.897297297297 942.4324324324323
*** budget = 50
VL50-1 0.04983807793870754 0.04471709219242653
RT-VL50-1 969.0799999999999 147.42208427957232 998.4 175.2
VL50-2 0.028874108501098898 0.02068020435243387
RT-VL50-2 957.0133333333332 155.34415069630256 1000.0 359.6
VL50-3 0.029484828488555898 0.02667745596564479
RT-VL50-3 962.3733333333333 82.0017517428066 1000.0 775.2
VL50-4 0.04556495073801193 0.039475089687377285
RT-VL50-4 966.9733333333331 65.1698648831566 1000.0 806.8
VL50-5 0.7196212384068588 0.26351427608079125
RT-VL50-5 992.9733333333334 13.850124748736226 1000.0 938.8
VL50-all 0.1746766408146466 0.27260159593659417
RT-VL50-all 969.6826666666666 12.362016088720273 992.9733333333334 957.0133333333332
logs_data/30hc/cgrew1/ablo/run_2024-08-10_20-39-00/run_1a172_00007_7_clip_discriminator=10,dr_cc=0.8000,seed=4_2024-08-10_20-39-00/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s4/08.10_20.39.49/models/ckpt_policy_T15000000.pt
0_hccustom_v0_type1-infogsdr_ppo-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s4
v_data: 0
Inferred from ckptpath name:
il_method: infogsdr
rl_method: ppo
activation: relu
hidden_size: [100, 100]
norm_obs: 0
info_loss_type: None
encode_sampling: normal
normalize_code: 0
tl_emb: 0
TRAINED C_DATA: 1
ARG C_DATA: 1
Using (100, 100) relu networks.
State dim: 17, action dim: 6, action bound 1
args.encode_dim: 2
TRAJ is loaded from /home/vsreeramdass3/code/vild_code1/imitation_data/STRAT_h5/HCCustom_v0.h5 with traj_num 25.0, data_size 25000 steps, and average return 996.96
No of trajs: Train 15 Val 3 Test 7
Total data pairs: 14985, K 15, state dim 17, action dim 6, a min -0.7615942, a_max 0.7615942
Policy model is loaded from logs_data/30hc/cgrew1/ablo/run_2024-08-10_20-39-00/run_1a172_00007_7_clip_discriminator=10,dr_cc=0.8000,seed=4_2024-08-10_20-39-00/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s4/08.10_20.39.49/models/ckpt_policy_T15000000.pt
Discr model is loaded from logs_data/30hc/cgrew1/ablo/run_2024-08-10_20-39-00/run_1a172_00007_7_clip_discriminator=10,dr_cc=0.8000,seed=4_2024-08-10_20-39-00/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s4/08.10_20.39.49/models/ckpt_discr_T15000000.pt
***** Max Ep Steps: 1000 args.seed 1 test_seed 1 *****
*** budgets = [10, 20, 30, 40, 50] NPARALLEL = 50 n_test_episodes = 5 ***
***** num zs = 1500 *****
***** args.encode_sampling = normal *****
current_vel_mean [0.97, 1.78, 2.88, 3.65, 4.87]
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
zvs shape 1500
RT 962.8856000000001 153.9301563891018 1000.0 -78.0
LR 0.9565333333333335 0.20429712566640668 1.0 -0.6
VL-std-all 0.2619756882997013 0.0
*** budget = 10
VL10-1 0.15361818466635638 0.15330476465497578
RT-VL10-1 966.232 156.11071127888695 1000.0 78.4
VL10-2 0.16128052515106647 0.1455866638719984
RT-VL10-2 976.0586666666668 112.84541295457642 1000.0 381.2
VL10-3 0.3012678928157036 0.19677311766702288
RT-VL10-3 956.7466666666667 97.62714012450067 1000.0 606.8
VL10-4 0.14758918291872652 0.20539026673974697
RT-VL10-4 989.36 26.951294341212385 1000.0 861.2
VL10-5 0.9825230565075204 0.31697366500651225
RT-VL10-5 996.2080000000001 11.369922427176014 1000.0 910.8
VL10-all 0.3492557684118747 0.32174787095906543
RT-VL10-all 976.9210666666668 14.482535380243363 996.2080000000001 956.7466666666667
*** budget = 20
VL20-1 0.06587802204668669 0.06780309654101134
RT-VL20-1 967.1413333333333 151.07936906878524 1000.0 204.0
VL20-2 0.0839002191119055 0.07866285472700205
RT-VL20-2 984.4853333333334 87.89541162591416 1000.0 442.4
VL20-3 0.2020517191379541 0.14660188874430358
RT-VL20-3 943.8986666666668 111.97161722905 1000.0 634.4
VL20-4 0.06881999852011207 0.06418221463075031
RT-VL20-4 990.7626666666669 24.05045126857753 1000.0 888.4
VL20-5 0.8199739550841811 0.10097706994255853
RT-VL20-5 998.1386666666666 3.817429967341325 1000.0 980.4
VL20-all 0.2481247827801679 0.290333600475931
RT-VL20-all 976.8853333333334 19.41705576033604 998.1386666666666 943.8986666666668
*** budget = 30
VL30-1 0.051557117743878764 0.05851158868751511
RT-VL30-1 981.96 111.71364106500154 999.6 200.0
VL30-2 0.0529426752946929 0.04769829719640866
RT-VL30-2 977.4079999999999 106.94732131287816 1000.0 442.4
VL30-3 0.15576994015293347 0.11326933161279805
RT-VL30-3 943.2559999999999 106.64906218059305 1000.0 680.0
VL30-4 0.046845593846675276 0.04019459205278116
RT-VL30-4 991.784 21.375755986631216 1000.0 906.8
VL30-5 0.7878288789380459 0.10410910977246518
RT-VL30-5 998.592 2.8116785022473683 1000.0 981.6
VL30-all 0.21898884119524525 0.2873373024249055
RT-VL30-all 978.6 19.159367004157573 998.592 943.2559999999999
*** budget = 40
VL40-1 0.03614761956283649 0.039080354825007525
RT-VL40-1 977.2216216216217 123.342071328207 999.6 237.2
VL40-2 0.03703350134868353 0.03624682227163011
RT-VL40-2 969.7405405405403 123.41074345334694 1000.0 442.4
VL40-3 0.12885151845129844 0.10251930754191889
RT-VL40-3 922.3783783783782 120.1785678944217 1000.0 680.0
VL40-4 0.03414685034617606 0.028663373329982207
RT-VL40-4 986.3783783783782 29.887131237148875 1000.0 888.4
VL40-5 0.7645489360564631 0.10168720936137843
RT-VL40-5 998.7567567567565 3.026061622114248 1000.0 981.6
VL40-all 0.20014568515309153 0.2844962041982422
RT-VL40-all 970.895135135135 26.1198544668676 998.7567567567565 922.3783783783782
*** budget = 50
VL50-1 0.026807532116723366 0.02788030365911426
RT-VL50-1 971.1600000000002 142.4622092579877 999.6 204.0
VL50-2 0.03346613103657808 0.030052688619460043
RT-VL50-2 962.8666666666666 136.13938282347087 1000.0 442.4
VL50-3 0.11555711249445282 0.09705347067713302
RT-VL50-3 936.8533333333331 113.0977504442752 1000.0 712.0
VL50-4 0.03356406338018905 0.03405640340714334
RT-VL50-4 987.2933333333333 28.471962036751567 1000.0 888.4
VL50-5 0.7412973857097003 0.10011368447281736
RT-VL50-5 998.6800000000001 3.303573822392953 1000.0 981.6
VL50-all 0.1901384449475287 0.2775165909015551
RT-VL50-all 971.3706666666667 21.275054396065812 998.6800000000001 936.8533333333331
logs_data/30hc/cgrew1/ablo/run_2024-08-10_20-39-00/run_1a172_00009_9_clip_discriminator=10,dr_cc=0.8000,seed=5_2024-08-10_20-39-00/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s5/08.10_20.39.54/models/ckpt_policy_T15000000.pt
0_hccustom_v0_type1-infogsdr_ppo-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s5
v_data: 0
Inferred from ckptpath name:
il_method: infogsdr
rl_method: ppo
activation: relu
hidden_size: [100, 100]
norm_obs: 0
info_loss_type: None
encode_sampling: normal
normalize_code: 0
tl_emb: 0
TRAINED C_DATA: 1
ARG C_DATA: 1
Using (100, 100) relu networks.
State dim: 17, action dim: 6, action bound 1
args.encode_dim: 2
TRAJ is loaded from /home/vsreeramdass3/code/vild_code1/imitation_data/STRAT_h5/HCCustom_v0.h5 with traj_num 25.0, data_size 25000 steps, and average return 996.96
No of trajs: Train 15 Val 3 Test 7
Total data pairs: 14985, K 15, state dim 17, action dim 6, a min -0.7615942, a_max 0.7615942
Policy model is loaded from logs_data/30hc/cgrew1/ablo/run_2024-08-10_20-39-00/run_1a172_00009_9_clip_discriminator=10,dr_cc=0.8000,seed=5_2024-08-10_20-39-00/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s5/08.10_20.39.54/models/ckpt_policy_T15000000.pt
Discr model is loaded from logs_data/30hc/cgrew1/ablo/run_2024-08-10_20-39-00/run_1a172_00009_9_clip_discriminator=10,dr_cc=0.8000,seed=5_2024-08-10_20-39-00/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s5/08.10_20.39.54/models/ckpt_discr_T15000000.pt
***** Max Ep Steps: 1000 args.seed 1 test_seed 1 *****
*** budgets = [10, 20, 30, 40, 50] NPARALLEL = 50 n_test_episodes = 5 ***
***** num zs = 1500 *****
***** args.encode_sampling = normal *****
current_vel_mean [0.97, 1.78, 2.88, 3.65, 4.87]
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
zvs shape 1500
RT 978.5506666666666 112.07245284289186 1000.0 -135.6
LR 0.9757333333333332 0.1473243436171437 1.0 -1.0
VL-std-all 0.17689553347058182 0.0
*** budget = 10
VL10-1 0.08646988342561182 0.07476326676970103
RT-VL10-1 997.2293333333334 1.4199317667487492 999.6 992.4
VL10-2 0.15784264131377246 0.12926345030839645
RT-VL10-2 992.7146666666667 41.982309586565414 1000.0 577.2
VL10-3 0.33050431897947186 0.31570504261195015
RT-VL10-3 972.04 69.05692724122613 1000.0 640.4
VL10-4 0.5663816464369016 0.5352145680398095
RT-VL10-4 981.6239999999998 50.99680993944621 1000.0 640.4
VL10-5 1.6977302919347916 0.6171926469733827
RT-VL10-5 982.2053333333332 50.76285753273636 1000.0 640.4
VL10-all 0.5677857564181099 0.5886930325114735
RT-VL10-all 985.1626666666667 8.900668639053114 997.2293333333334 972.04
*** budget = 20
VL20-1 0.04489422741729457 0.04788873591461115
RT-VL20-1 997.3066666666666 1.3359973386533712 999.6 993.2
VL20-2 0.07895439649576952 0.06399997418738158
RT-VL20-2 992.0746666666668 48.265032458522406 1000.0 577.2
VL20-3 0.16418171830220007 0.17988433486841762
RT-VL20-3 964.5919999999999 75.64710659370918 999.2 695.2
VL20-4 0.26830815340136777 0.2798197612508025
RT-VL20-4 989.9040000000002 25.411710896094082 1000.0 845.6
VL20-5 1.3333561510514038 0.3793314161065256
RT-VL20-5 990.3520000000001 24.191499112980434 1000.0 845.6
VL20-all 0.3779389293336072 0.4839098354456477
RT-VL20-all 986.8458666666668 11.433708218538209 997.3066666666666 964.5919999999999
*** budget = 30
VL30-1 0.02887933432341529 0.03240043765869242
RT-VL30-1 997.368 1.2752160601247104 999.6 993.2
VL30-2 0.049301720362431196 0.03609641596148447
RT-VL30-2 997.7279999999998 1.8109710102594108 1000.0 991.6
VL30-3 0.09734011240749069 0.0863036896139303
RT-VL30-3 965.6239999999999 75.61013307751811 998.8 695.2
VL30-4 0.15853655285858195 0.13448786104410146
RT-VL30-4 988.568 29.753342266037947 1000.0 845.6
VL30-5 1.173441913108643 0.22938847913252008
RT-VL30-5 988.1519999999999 29.99582797657034 1000.0 845.6
VL30-all 0.30149992661211245 0.43825301056854593
RT-VL30-all 987.4879999999999 11.68001534245569 997.7279999999998 965.6239999999999
*** budget = 40
VL40-1 0.025211131181165972 0.027925533518416985
RT-VL40-1 997.2432432432432 1.2845317877754892 999.2 993.2
VL40-2 0.04111544991643272 0.031258017187546824
RT-VL40-2 997.4918918918918 2.049440052515121 1000.0 991.6
VL40-3 0.07423256088286218 0.06561704213402379
RT-VL40-3 965.7945945945946 76.36819044158459 998.8 695.2
VL40-4 0.12949515676674414 0.12081756144783931
RT-VL40-4 985.5459459459461 33.4025293960073 1000.0 845.6
VL40-5 1.1249827051398824 0.19908522918383662
RT-VL40-5 986.2810810810811 33.177636866124935 1000.0 845.6
VL40-all 0.2790074007774175 0.42449169332262116
RT-VL40-all 986.4713513513514 11.54042301351634 997.4918918918918 965.7945945945946
*** budget = 50
VL50-1 0.015643034480733578 0.014805983925466266
RT-VL50-1 997.36 1.2026637102698265 999.2 993.2
VL50-2 0.03680039472843332 0.027545453805908286
RT-VL50-2 997.2933333333334 2.15080346744084 1000.0 991.6
VL50-3 0.05750160150041843 0.06286562190652362
RT-VL50-3 962.8933333333334 82.48582477546604 998.8 695.2
VL50-4 0.10473184703408557 0.07819596053754985
RT-VL50-4 991.8266666666666 19.95880646620823 1000.0 885.6
VL50-5 1.0663962943203606 0.14504776383105608
RT-VL50-5 992.52 20.002973112348403 1000.0 885.6
VL50-all 0.25621463441280634 0.4061638904782096
RT-VL50-all 988.3786666666667 12.951271683592365 997.36 962.8933333333334
logs_data/30hc/cgrew/ig/run_2024-05-29_17-35-27/run_5db2e_00009_9_dr_cc=0.8000,seed=5_2024-05-29_17-35-27/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr0_no0_es=n_sr0000_rc0.80_reg00_ds0.001_s5/05.29_17.36.03/models/ckpt_policy_T15000000.pt
0_hccustom_v0_type1-infogsdr_ppo-100-100-relu_cr0_no0_es=n_sr0000_rc0.80_reg00_ds0.001_s5
v_data: 0
Inferred from ckptpath name:
il_method: infogsdr
rl_method: ppo
activation: relu
hidden_size: [100, 100]
norm_obs: 0
info_loss_type: None
encode_sampling: normal
normalize_code: 0
tl_emb: 0
TRAINED C_DATA: 1
ARG C_DATA: 1
Using (100, 100) relu networks.
State dim: 17, action dim: 6, action bound 1
args.encode_dim: 2
TRAJ is loaded from /home/vsreeramdass3/code/vild_code1/imitation_data/STRAT_h5/HCCustom_v0.h5 with traj_num 25.0, data_size 25000 steps, and average return 996.96
No of trajs: Train 15 Val 3 Test 7
Total data pairs: 14985, K 15, state dim 17, action dim 6, a min -0.7615942, a_max 0.7615942
Policy model is loaded from logs_data/30hc/cgrew/ig/run_2024-05-29_17-35-27/run_5db2e_00009_9_dr_cc=0.8000,seed=5_2024-05-29_17-35-27/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr0_no0_es=n_sr0000_rc0.80_reg00_ds0.001_s5/05.29_17.36.03/models/ckpt_policy_T15000000.pt
Error(s) in loading state_dict for PairDiscriminator:
Missing key(s) in state_dict: "reward_offset".
***** Max Ep Steps: 1000 args.seed 1 test_seed 1 *****
*** budgets = [10, 20, 30, 40, 50] NPARALLEL = 50 n_test_episodes = 5 ***
***** num zs = 1500 *****
***** args.encode_sampling = normal *****
current_vel_mean [0.97, 1.78, 2.88, 3.65, 4.87]
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
zvs shape 1500
RT 980.8168000000002 72.72508992381286 1000.0 228.4
LR 0.9677333333333334 0.14578132787005185 1.0 -1.0
VL-std-all 0.3414440241975213 0.0
*** budget = 10
VL10-1 0.15367732728713285 0.22755907228109767
RT-VL10-1 987.68 78.23173780506221 1000.0 316.4
VL10-2 0.287133356934324 0.22621422265134575
RT-VL10-2 959.4133333333333 133.25837092739135 1000.0 318.8
VL10-3 0.1498754722240085 0.1541521221743166
RT-VL10-3 974.1279999999999 68.51932391182311 1000.0 622.4
VL10-4 0.13726981986091463 0.17446359044058316
RT-VL10-4 991.738666666667 26.479677205149027 1000.0 796.4
VL10-5 0.7153623680968946 0.3848291602027269
RT-VL10-5 994.0266666666666 18.736441023370002 1000.0 832.4
VL10-all 0.28866366888065487 0.22021702715117328
RT-VL10-all 981.3973333333333 12.977323291889732 994.0266666666666 959.4133333333333
*** budget = 20
VL20-1 0.06147138883215753 0.049118940022679546
RT-VL20-1 988.2133333333335 78.10571525538676 999.2 316.4
VL20-2 0.16356105067158155 0.14776226777161292
RT-VL20-2 974.2186666666668 104.69538378022638 1000.0 442.4
VL20-3 0.06903402001043982 0.06348008746472343
RT-VL20-3 981.1146666666667 54.275751966744366 1000.0 747.2
VL20-4 0.057398821883375736 0.0762303445330342
RT-VL20-4 994.8426666666667 11.862604248458924 1000.0 916.8
VL20-5 0.4843280574221539 0.17128230921545165
RT-VL20-5 997.8133333333335 6.311161189159693 1000.0 948.0
VL20-all 0.1671586677639417 0.16337370399019865
RT-VL20-all 987.2405333333334 8.690432955842887 997.8133333333335 974.2186666666668
*** budget = 30
VL30-1 0.048712189814879105 0.04609758259858706
RT-VL30-1 997.3840000000001 1.154358696419793 999.2 994.0
VL30-2 0.13868829511151443 0.1362505213579863
RT-VL30-2 983.12 78.00943532676031 1000.0 486.4
VL30-3 0.04598434731397307 0.043458723647538854
RT-VL30-3 976.6160000000001 57.38533387547727 1000.0 793.6
VL30-4 0.030668599334237285 0.03362094666785723
RT-VL30-4 995.52 8.703102894945 1000.0 957.2
VL30-5 0.43020515264972586 0.13332807652363876
RT-VL30-5 996.3439999999999 9.597336297119107 1000.0 937.2
VL30-all 0.13885171684486594 0.15055890596909582
RT-VL30-all 989.7968000000001 8.38450471763238 997.3840000000001 976.6160000000001
*** budget = 40
VL40-1 0.04110912901549783 0.04483745318333667
RT-VL40-1 978.8864864864865 110.42029328314749 999.2 316.4
VL40-2 0.08519144538525422 0.07372411530788607
RT-VL40-2 955.8810810810812 141.9320237712555 1000.0 442.4
VL40-3 0.03458398723099483 0.03296585906247236
RT-VL40-3 984.2486486486486 50.098743694782094 1000.0 801.2
VL40-4 0.025269367612474124 0.02873033058717423
RT-VL40-4 995.8378378378377 7.988347978973053 1000.0 957.2
VL40-5 0.3991593249379236 0.10614861705318035
RT-VL40-5 997.8702702702702 3.354453664371211 1000.0 983.2
VL40-all 0.11706265083642893 0.14254261225907858
RT-VL40-all 982.544864864865 15.091820026158858 997.8702702702702 955.8810810810812
*** budget = 50
VL50-1 0.030365551619425928 0.02191714581695009
RT-VL50-1 997.3999999999999 1.036661307596009 999.2 995.6
VL50-2 0.0634740504922024 0.06872433247737107
RT-VL50-2 980.8133333333332 91.82310433049456 1000.0 486.4
VL50-3 0.02587799910281765 0.022566850425966352
RT-VL50-3 986.3333333333334 47.40116266740956 1000.0 801.2
VL50-4 0.019447311213732364 0.019433500339719197
RT-VL50-4 988.6666666666666 36.76757025181596 1000.0 796.4
VL50-5 0.37622115880209334 0.09985004702464956
RT-VL50-5 997.6933333333334 3.6216877587973006 1000.0 983.2
VL50-all 0.10307721424605434 0.13741665867630667
RT-VL50-all 990.1813333333333 6.532993732670458 997.6933333333334 980.8133333333332
logs_data/30hc/cgrew/ig/run_2024-05-29_17-35-27/run_5db2e_00007_7_dr_cc=0.8000,seed=4_2024-05-29_17-35-27/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr0_no0_es=n_sr0000_rc0.80_reg00_ds0.001_s4/05.29_17.36.04/models/ckpt_policy_T15000000.pt
0_hccustom_v0_type1-infogsdr_ppo-100-100-relu_cr0_no0_es=n_sr0000_rc0.80_reg00_ds0.001_s4
v_data: 0
Inferred from ckptpath name:
il_method: infogsdr
rl_method: ppo
activation: relu
hidden_size: [100, 100]
norm_obs: 0
info_loss_type: None
encode_sampling: normal
normalize_code: 0
tl_emb: 0
TRAINED C_DATA: 1
ARG C_DATA: 1
Using (100, 100) relu networks.
State dim: 17, action dim: 6, action bound 1
args.encode_dim: 2
TRAJ is loaded from /home/vsreeramdass3/code/vild_code1/imitation_data/STRAT_h5/HCCustom_v0.h5 with traj_num 25.0, data_size 25000 steps, and average return 996.96
No of trajs: Train 15 Val 3 Test 7
Total data pairs: 14985, K 15, state dim 17, action dim 6, a min -0.7615942, a_max 0.7615942
Policy model is loaded from logs_data/30hc/cgrew/ig/run_2024-05-29_17-35-27/run_5db2e_00007_7_dr_cc=0.8000,seed=4_2024-05-29_17-35-27/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type1-INFOGSDR_PPO-100-100-relu_cr0_no0_es=n_sr0000_rc0.80_reg00_ds0.001_s4/05.29_17.36.04/models/ckpt_policy_T15000000.pt
Error(s) in loading state_dict for PairDiscriminator:
Missing key(s) in state_dict: "reward_offset".
***** Max Ep Steps: 1000 args.seed 1 test_seed 1 *****
*** budgets = [10, 20, 30, 40, 50] NPARALLEL = 50 n_test_episodes = 5 ***
***** num zs = 1500 *****
***** args.encode_sampling = normal *****
current_vel_mean [0.97, 1.78, 2.88, 3.65, 4.87]
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
zvs shape 1500
RT 972.9648 115.9320717818269 1000.0 -328.4
LR 0.9816 0.1194770828792423 1.0 -0.6
VL-std-all 0.291341067868871 0.0
*** budget = 10
VL10-1 0.08538698259098307 0.07595966285046778
RT-VL10-1 982.2160000000001 106.39672806999282 1000.0 202.0
VL10-2 0.29109541746342094 0.17889799779516616
RT-VL10-2 973.8213333333334 90.80233226569068 1000.0 518.4
VL10-3 0.157312527072331 0.18993173585092252
RT-VL10-3 972.5466666666669 56.90906742827153 1000.0 669.6
VL10-4 0.32921408269790575 0.2254172519032038
RT-VL10-4 979.8826666666669 43.08747187859702 1000.0 771.2
VL10-5 0.47248192099967506 0.5428979074220518
RT-VL10-5 987.1973333333332 28.38021833758311 1000.0 771.2
VL10-all 0.2670981861648632 0.13541682516840087
RT-VL10-all 979.1328000000001 5.416529342259171 987.1973333333332 972.5466666666669
*** budget = 20
VL20-1 0.047211700142732176 0.0395606234783512
RT-VL20-1 986.8053333333332 91.23644870091971 999.6 202.0
VL20-2 0.1922656022879277 0.135858518468895
RT-VL20-2 958.6666666666665 117.89500111916912 1000.0 518.4
VL20-3 0.05782860009790189 0.05143286414777555
RT-VL20-3 976.4000000000002 47.415373034491665 1000.0 797.6
VL20-4 0.19410856218232772 0.15117381337269264
RT-VL20-4 981.9306666666668 46.49449565510118 1000.0 771.2
VL20-5 0.16330413681938413 0.2603783599273052
RT-VL20-5 992.1493333333333 13.37759692753357 1000.0 936.8
VL20-all 0.13094372030605472 0.0650449675526911
RT-VL20-all 979.1903999999998 11.51034479336839 992.1493333333333 958.6666666666665
*** budget = 30
VL30-1 0.03227854892347282 0.02265652253710592
RT-VL30-1 981.328 111.33584335693516 999.2 202.0
VL30-2 0.14174930677853792 0.10486946166453731
RT-VL30-2 952.0 128.46923989811725 1000.0 518.4
VL30-3 0.04124515676997539 0.04043819741756769
RT-VL30-3 980.4479999999999 39.026115563811885 1000.0 816.4
VL30-4 0.15188327677070457 0.12743220708438074