Files
replaysim/docs/assets/frontier_vllm_alignment/frontier_vllm_alignment.csv

5.2 KiB

1run_idlabeltprequest_countscale_labelscale_valuefixturekv_blocksfrontier_completedfrontier_totalfrontier_completevllm_completedvllm_totalfrontier_preemptionsvllm_preemptionsfrontier_prefix_hitvllm_prefix_hitprefix_hit_deltafrontier_rpsvllm_rpsrps_ratiofrontier_total_tpsvllm_total_tpstotal_tps_ratiofrontier_decode_tpsvllm_decode_tpsdecode_tps_ratiofrontier_ttft_p50_svllm_ttft_p50_sttft_p50_ratiofrontier_ttft_p95_svllm_ttft_p95_sttft_p95_ratiofrontier_tpot_p50_svllm_tpot_p50_stpot_p50_ratiofrontier_tpot_p95_svllm_tpot_p95_stpot_p95_ratiofrontier_e2e_p50_svllm_e2e_p50_se2e_p50_ratiofrontier_e2e_p95_svllm_e2e_p95_se2e_p95_rationotes
2tp1_n100_scale1TP1 N100 raw1100raw1coder_1001528196100false100100080.24878456160.2510820686-0.0022975070750.40481487950.68798806910.5884039242348.9088213832.3205810.6129207541347.7992338567.44567950.61292075410.90874811364.5030254950.20180834312.7629581529.060469060.43918624020.056889664280.066081343960.86090356030.14568807930.62114914710.234546050530.9392831641.840767330.7394530534119.636137697.366229691.228723121Frontier incomplete before lifecycle fix; included as TP1 100-request baseline.
3tp1_n500_scale1TP1 N500 raw1500raw1coder_50015281439500false5005000630.11923746920.3868498695-0.26761240020.6609904720.84017194510.78673237764733.7487625282.9037310.896050544656.2204998732.34763840.896050544136.7755789185.65816830.7367064976340.2371222375.89500670.90513871190.056432747390.049752536241.1342687560.089428397730.09187985390.9733188935177.7998574224.26978720.7927945162397.29145417.35629330.9519239469Frontier incomplete; useful as high-pressure stress signal.
4tp1_n200_scale0667TP1 N200 scale 0.66712000.6670.6666666667coder_200_ts066715281176200false2002000260.1702760080.2697549478-0.099478939840.58309037060.82367882150.70790987373913.4375264864.7789090.8044430383593.287826737.513780.804443038320.5801453234.563236520.59543455496.71793818120.80398180.8006187940.058370966510.051454318971.134423070.2358945690.25347574960.930639595473.2073116983.62199050.875455263189.2402903183.7269771.030008186Dense-arrival run; Frontier incomplete before lifecycle fix.
5tp1_n200_scale2TP1 N200 scale 2120022coder_200_ts215281200200true20020033430.231341690.2697549478-0.038413257840.59366276550.80298136350.73932321783506.2672794742.536410.7393232178531.5597036718.98148310.73932321789.5953212749.2167670961.04107233877.5034105369.211415951.1198067470.054213625460.049703375191.090743340.066531626460.068633095320.969381114961.4576941255.002487341.117362088174.4840836142.33750871.225847531After Frontier decode-preemption lifecycle fix.
6tp1_n200_scale3TP1 N200 scale 3120033coder_200_ts315281200200true20020020160.21767512780.2697549478-0.052079820070.57397816520.78022655040.7356557723390.006884608.1428430.735655772513.9343094698.6070510.7356557721.0014741161.1661514780.858785616245.946656732.258424471.4243304640.053393334370.046161597141.1566613310.068612546710.07138362960.961180414844.7605814533.212675881.34769573154.5483135122.78871131.258652459After Frontier decode-preemption lifecycle fix.
7tp2_n200_scale2TP2 N200 scale 2220022coder_200_ts269055200200true200200000.26975494780.269754947800.77568235721.2778186830.6070363254581.3041117547.0015910.607036325694.53822581144.146070.6070363250.26909596210.2251191161.1953492316.7446242230.7150717769.4320940220.042955276580.030044996791.4296981580.052887647320.043403823181.21850204626.0512248216.448610071.583794905106.759165172.53471791.471835394Uses true-mixed TP2/TP4 attention profile.
8tp2_n200_scale3TP2 N200 scale 3220033coder_200_ts369055200200true200200000.26975494780.269754947800.68777053211.0880502780.63211282254062.0828066426.1990280.6321128225615.8228567974.22933820.63211282250.13415354950.1535309430.87378835110.57413782180.62704555110.91562378640.039378968490.019057672562.066305230.046707672250.027990820971.66867818221.785964949.9560033742.188223941101.591839353.983486211.881905864Uses true-mixed TP2/TP4 attention profile.
9tp4_n200_scale2TP4 N200 scale 2420022coder_200_ts2177077200200true200200000.26975494780.269754947800.85253379311.5362035370.55496148295035.2009879073.0638840.5549614829763.3502331375.5012850.55496148290.097555150410.17049726190.57218015890.38568723421.4198614080.27163724010.033665850470.016344377352.0597817670.038382656210.028316900261.35546814318.652162829.2608854882.01407984684.9377541443.621889031.947136083Uses true-mixed TP2/TP4 attention profile.
10tp4_n200_scale3TP4 N200 scale 3420033coder_200_ts3177077200200true200200000.26975494780.269754947800.73736651721.2535044930.58824401624355.0046297403.3980960.5882440162660.23060591122.3753880.58824401620.088597491350.1001062780.8850343170.34589546170.31841881011.0862909190.031067781090.0094102842123.3014710710.035782850820.012792766682.7971158816.902919415.549487323.04585242483.0099536527.869075832.978568581Uses true-mixed TP2/TP4 attention profile.