Computing Area under the ROC curve #1909
-
Dear MET_help support team, I am using MET to compute the ROC curve and the area under the ROC curve associated with precipitation fields. To do this, I use the "grid_stat" as follows: ./MET_TOOL/10.1.2/bin/grid_stat In the grid_stat_000000L_19700101_000000V_nbrcts.txt file, I found the "PODY" and "POFD" necessary to plot the ROC curve. I was able to do that without problem. However, I was wondering if MET also compute the area under the ROC curve or if I should compute that by myself. In case MET computes that area, how can I found that variable? In grid_stat_000000L_19700101_000000V_nbrcts.txt file, I haven't found anything that makes me think that the area under the ROC curve is computed. Could you help me with this? Thank you very much in advance, Best Regards, |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 8 replies
-
Hi Diego, And thank you for your question. MET does calculate area under the ROC curve; however, this output is only available in the PSTD line type, so you would need to set up your input forecast data as probabilistic. The variable from the PSTD line type is called Area Under the ROC curve, or AUC; documentation can be found here in the MET user's guide. From the documentation, you can see the equation that MET uses to calculate AUC. You could also get a PSTD line type output from a Stat-Analysis run; looking at the documentation for an |
Beta Was this translation helpful? Give feedback.
-
Hi Diego, I can certainly help with those questions. PSTD is the contingency table statistics for probabilistic forecasts' line type in MET. It contains numerous statistics related to probabilistic forecasts, including the Brier Skill Score (BSS) and the Area Under the ROC curve (AUC). PCT is another line type for probabilistic forecasts, but for what you're trying to accomplish you can ignore it. As I thought of how we can get your forecast precipitation data into a probabilistic range (0 to 1 or 0 to 100) to successfully request a PSTD line type, however, I realized this approach is probably not going to work in MET. There are methods for returning an uncalibrated ensemble probability forecast with the GenEnsProd tool, but none that would let you set thresholds for your forecast data and calculate probabilities for each grid point based on those thresholds. That would most likely require climatology data and a completely different set up. Instead what I'll do is bring in @bikegeek to the discussion who might know a way to calculate the area under the ROC curve in our METcalcpy or METplotpy components. They should also be able to help you past the plotting behavior you're seeing. |
Beta Was this translation helpful? Give feedback.
-
Hi Diego, I'm not exactly sure how this works in METplus, but my understanding is that you have to trick it to get the ROC/AUC by making it think you have probabilistic verification sets. So, I would figure out how it works for those data and try to arrange your data in that manner. Someone else (Barb maybe) might be able to better explain what form your data need to be in than I. There is definitely something awry with your ROC plot as I don't think there should be a loop in it. Best, Eric |
Beta Was this translation helpful? Give feedback.
Hi Diego,
I'm not exactly sure how this works in METplus, but my understanding is that you have to trick it to get the ROC/AUC by making it think you have probabilistic verification sets. So, I would figure out how it works for those data and try to arrange your data in that manner. Someone else (Barb maybe) might be able to better explain what form your data need to be in than I.
There is definitely something awry with your ROC plot as I don't think there should be a loop in it.
Best,
Eric