#+caption: ROC space plot with indifference line.
#+attr_html: :width 300
-[[file:assets/indifference-line.png]]
+[[file:../assets/indifference-line.png]]
An optimal and viable cutoff is the point of the tangent of the left-most parallel line to the indifference line and the ROC curve.
#+caption: Bag of oranges labelled half-time oranges
#+attr_html: :class img-fluid :alt Bag of oranges labelled half-time oranges
-[[file:assets/half-time-oranges.png]]
+[[file:../assets/half-time-oranges.png]]
#+CAPTION: Barefoot & Salty XL Surf balance board
#+ATTR_HTML: :class img-fluid :alt Barefoot & Salty XL Surf balance board
-[[file:assets/balance-board.png]]
+[[file:../assets/balance-board.png]]
Public Sector Network's Innovate WA conference today started with a poll amongst the attendees, asking for our biggest goal or aspiration for the public sector in Western Australia. Overwhelmingly, collaboration came out as the main opportunity for contributors and decision makers in the sector. Closely linked was the desire to better share data between government departments and functions. In his opening address, WA Minister for Innovation Stephen Dawson touched on that, mentioning the State Government is planning to introduce legislation later this year around privacy and responsible data sharing. This will be the first time WA government agencies and state-owned enterprises will be subject to privacy laws, and at the same time is hoped to encourage data sharing that should result in better outcomes for citizens of the state.
#+caption: poll results
-[[file:assets/Innovate_WA_2023.png]]
+[[file:../assets/Innovate_WA_2023.png]]
Greg Italiano, the state government's CIO, gave an update on the digital transformation of the WA government. Delivering a digital identity has been a key milestone so far - no easy task given the many arms of government at state and federal level that were involved. He acknowledged the Service WA app doesn't offer a compelling range of services to deal with government so far though - finding your best deal for refueling and notices on shark detections probably don't top the list of needs for many.
Looking at the number of trips per day on a calendar heatmap reveals something odd - the first week of August has very few rides compared to any other week. While it's known people in NY tend to leave the city in August, this drop is odd.
#+caption: Calendar heatmap of trips
-[[file:assets/NYCtaxitripsNbrPlot.png]]
+[[file:../assets/NYCtaxitripsNbrPlot.png]]
Deciding to ignore August altogether, and zooming in on occupancy rate of the taxis rather than the absolute number or rides, reveals an interesting insight - people travel together far more in weekends and on public holidays!
#+caption: Occupancy heatmap
-[[file:assets/NYCtaxioccupancyPlot.png]]
+[[file:../assets/NYCtaxioccupancyPlot.png]]
Just looking at the calendar heatmap it's possible to determine 1 Jan 2013 was a Tuesday and point out Memorial Day as the last Monday of May, Labour day in September, Thanksgiving day and even Black Friday at the end of November, and of course the silly season at the end of the year!)
#+caption: Perth solar exposure variation
#+ATTR_HTML: :class img-fluid :alt Perth solar exposure variation
-[[file:assets/solarExposure.png]]
+[[file:../assets/solarExposure.png]]
__Table:__ Average solar exposure per m<sup>2</sup> in Kings Park, Perth Jan 2017
to Jun 2023.
#+caption: Group picture with the house manager
#+ATTR_HTML: :class img-fluid :alt Group picture with the house manager
-[[file:assets/group-photo-RMDH.png]]
+[[file:../assets/group-photo-RMDH.png]]
We prepared lunch for the families and volunteers in the house, followed by a tour of the place by its manager. Hearing some of the stories about the children was heartbreaking and uplifting at the same time. During extremely demanding times for a family, both mentally and financially, the volunteers and the team at Ronald McDonald House do a great job in making life slightly easier for them. It was great to be able to contribute a little bit to that this Christmas period.
#+CAPTION: Mindmap on setting up analytics practice
#+ATTR_HTML: :class img-fluid :alt Mindmap on setting up analytics practice
-[[file:assets/Setting_up_an_analytics_practice.png]]
+[[file:../assets/Setting_up_an_analytics_practice.png]]
Ideas courtesy of Abhi Seth, Head of Data Science & Analytics at Honeywell Aerospace.
#+caption: Map of WA state roads
#+ATTR_HTML: :class img-fluid :alt Map of WA state roads
-[[file:assets/state-roads.png]]
+[[file:../assets/state-roads.png]]
Courtesy of the development version of ggplot2 - geom_sf is not yet
available in the version on CRAN.
#+CAPTION: Overall trendline
#+ATTR_HTML: :class img-fluid :alt Overall trendline
-[[file:assets/explore-AU-road-fatalities_files/fatalitiesTrends-1.png]]
+[[file:../assets/explore-AU-road-fatalities_files/fatalitiesTrends-1.png]]
#+CAPTION: Trendlines by Australian state
#+ATTR_HTML: :class img-fluid :alt Trendline by Australian state
-[[file:assets/explore-AU-road-fatalities_files/fatalitiesTrends-2.png]]
+[[file:../assets/explore-AU-road-fatalities_files/fatalitiesTrends-2.png]]
#+CAPTION: Trendlines by gender
#+ATTR_HTML: :class img-fluid :alt Trendlines by gender
-[[file:assets/explore-AU-road-fatalities_files/fatalitiesTrends-3.png]]
+[[file:../assets/explore-AU-road-fatalities_files/fatalitiesTrends-3.png]]
** What age group is most at risk in city traffic?
:PROPERTIES:
#+CAPTION: histogram
#+ATTR_HTLM: :class img-fluid :alt histogram
-[[file:assets/explore-AU-road-fatalities_files/fatalities.cityTraffic-1.png]]
+[[file:../assets/explore-AU-road-fatalities_files/fatalities.cityTraffic-1.png]]
** Hypothesis
:PROPERTIES:
#+CAPTION: densityplot
#+ATTR_HTML: :class img-fluid :alt densityplot
-[[file:assets/explore-AU-road-fatalities_files/fatalitiesSegmentation-1.png]]
+[[file:../assets/explore-AU-road-fatalities_files/fatalitiesSegmentation-1.png]]
#+CAPTION: boxplot
#+ATTR_HTML: :class img-fluid :alt boxplot
-[[file:assets/explore-AU-road-fatalities_files/fatalitiesSegmentation-2.png]]
+[[file:../assets/explore-AU-road-fatalities_files/fatalitiesSegmentation-2.png]]
Some further statistical analysis does confirm the hypothesis!
#+CAPTION: quantile-quantile plot
#+ATTR_HTML: :class img-fluid :alt quantile-quantile plot
-[[file:assets/explore-AU-road-fatalities_files/fatalitiesDistComp-1.png]]
+[[file:../assets/explore-AU-road-fatalities_files/fatalitiesDistComp-1.png]]
Output:
#+CAPTION: Example plot with 2 facets labelled Segment B and Segment A.
#+ATTR_HTML: :class img-fluid :alt Example plot with 2 facets labelled Segment B and Segment A.
-[[file:assets/2016-10-05_R-facet.png]]
+[[file:../assets/2016-10-05_R-facet.png]]
#+CAPTION: seasonal subseries plot Australian births by month 1996-2014
#+ATTR_HTML: :class img-fluid :alt seasonal subseries plot Australian births by month 1996-2014
-[[file:assets/au-births-seasonal-subseries-plot.png]]
+[[file:../assets/au-births-seasonal-subseries-plot.png]]
In preparing the graph above (a "seasonal subseries plot"), I could not
help but notice the spike in the numbers for each month around 2005. It
#+CAPTION: Table with evolution of house prices
#+ATTR_HTML: :class img-fluid :alt Table with evolution of house prices
-[[file:assets/house-price-evolution-table.png]]
+[[file:../assets/house-price-evolution-table.png]]
Of course, the AUD/EUR exchange rate needs to be taken into account as
well. I've adjusted the prices using the weighted average monthly
#+CAPTION: Plot with evolution of house prices in EUR
#+ATTR_HTML: :class img-fluid :alt Plot with evolution of house prices in EUR
-[[file:assets/house-price-evolution-plot.png]]
+[[file:../assets/house-price-evolution-plot.png]]
Comparing the first 3 months of 2021 to the first 3 months of 2023, the
relative price of an Australian residential dwelling has gone to 92% of
#+CAPTION: Actual vs predicted loan vintage performance.
#+ATTR_HTML: :width 400 :class img-fluid :alt Actual vs predicted loan vintage performance.
-[[file:assets/rollRateBeware.png]]
+[[file:../assets/rollRateBeware.png]]
The solid lines are the result of applying an average transition matrix
6 times (the model's predicted outcome). The dashed lines are the actual
#+CAPTION: CRISP-DM
#+ATTR_HTML: :class img-fluid :alt CRISP-DM
-[[file:assets/crisp-dm-diagram.png]]
+[[file:../assets/crisp-dm-diagram.png]]
It doesn't put too great an emphasis on scheduling the activities, but
focuses on the value creation.
#+CAPTION: OODA loop ([[Image by Patrick Edwin Moran - Own work, CC BY 3.0|https://commons.wikimedia.org/w/index.php?curid=3904554]])
#+ATTR_HTML: :class img-fluid :alt OODA loop
-[[file:assets/OODA-diagram.png]]
+[[file:../assets/OODA-diagram.png]]
#+CAPTION: Pupil density in Western Australian high schools
#+ATTR_HTML: :class img-fluid :alt Pupil density in Western Australian high schools
-[[file:assets/SLIP_WA_schools.png]]
+[[file:../assets/SLIP_WA_schools.png]]
#+CAPTION: Income distribution in major AU cities
#+ATTR_HTML: :class img-fluid :alt Income distribution in major AU cities
-[[file:assets/AUCitiesIncomeDistribution.gif]]
+[[file:../assets/AUCitiesIncomeDistribution.gif]]
The full code is available on my git server for you to clone using
=git clone git://git.vanrenterghem.biz/R/project-au-taxstats.git=.
#+CAPTION: Survival/incidence curve and risk table
#+ATTR_HTML :width 442 :class img-fluid :alt Survival/incidence curve and risk table
-[[file:assets/surv-curve-risk-table.png]]
+[[file:../assets/surv-curve-risk-table.png]]
Underneath the plot, a risk table is added with no effort by adding
=risk.table = TRUE= as parameter for =ggsurvplot=.