A Decade of Progress on City-Owned Buildings and Facilities

Sami El Sabri is a recent graduate from the University of Toronto’s Faculty of Arts and Sciences, specializing in public health and computational cognitive science. With strong skills in data-driven insights and statistical analysis, Sami is eager to advance public health research. This paper was prepared for Professor Rohan Alexander’s Methods of Data Analysis (STA302) course at the University of Toronto, applying analytical techniques to real-world urban issues.


This post explores the trends in renewable energy installations on City-owned buildings and facilities in Toronto over the past decade. Using data from the City of Toronto’s Open Data Portal, this analysis examines whether the city is on track to meet its commitment to an energy transition. This data is easily accessible at this link and has been analyzed using R and its various statistical analysis packages, along with QGIS for producing map visualizations.

Renewable energy is vital for the environmental health of a city like Toronto. Adopting renewable energy sources reduces greenhouse gas emissions, improves air quality, and fosters a more sustainable urban environment (Perea-Moreno et al., 2018). These environmental benefits are essential for the physical and social well-being of urban populations in the long term. Additionally, the equitable distribution of renewable energy installations ensures that all communities within the city can reap the benefits of cleaner energy sources, which can lead to improved economic opportunities and quality of life for residents (Robinson, 2020). The change must start somewhere, and when local governments take a top-down initiative, they can shift the culture and mobilize public support for renewable energy (van Staden, 2017).

The findings of this analysis highlight a significant increase in renewable energy installations both in quantity and quality, particularly in 2017 and 2018, reflecting the city’s intensified efforts towards its carbon neutrality goals. The data also reveals a diversification in the types of installations and a strategic expansion beyond the downtown core.

Data Source

For this study, I used a dataset from the City of Toronto’s Open Data Portal, specifically focusing on renewable energy installations on city-owned buildings and facilities. The dataset, titled “Renewable Energy Installations,” was last updated on April 22, 2022, and includes data from 2010 to 2019 collected by the Facilities Management Division. It details various types of renewable energy systems installed, such as photovoltaic panels, solar pool heating, and geothermal systems. It also shows if an installation fell under the FIT (Feed-In Tariff) or microFIT programs, which incentivize the use of renewable energy by guaranteeing long-term financing and contracts for both large and small installations, as well as energy storage solutions and Net Metering (NM) installations.

Key Findings

One of the most interesting findings from the data is the variation in the size and type of renewable energy installations over the years.

Figure 1: Size of new City-Owned Renewable Energy Installations in Toronto between 2010 and 2019 with fitted regression line.

Figure 1 illustrates the sizes of new installations over time, showing significant variation, particularly in 2016 and 2018. Due to this inconsistency, a simple trend line cannot reliably predict an overall increase in installation size over the years. When we look closer at the data, we see that the size of installations is closely related to the type of installation. For instance, in 2017, most new installations were smaller MicroFIT systems, which resulted in smaller average sizes for that year (see Figure 2). In contrast, 2018 had more large FIT installations, leading to larger average sizes. This variation highlights how the city is balancing both small-scale and large-scale renewable energy systems to diversify its energy portfolio.

Figure 2: Count of new City-Owned Renewable Energy Installations in Toronto by type of installation between 2010 and 2019

Figure 3 maps the geographical distribution of renewable energy installations across Toronto. Installations are evenly spread throughout the city, with strategic efforts to expand into peripheral areas. Interestingly, most of the city’s larger installations are in these areas, such as North York and Scarborough (Figure 4). As of 2019, there were still notable areas with low installation densities such as Toronto-Danforth, Etobicoke Centre, Scarborough-Rouge Park, and Don Valley City Wards. However, this expansion over time demonstrates a deliberate strategy to ensure equitable access to renewable energy across the city, reflecting a commitment to inclusive and widespread sustainability efforts.

Figure 3: Geographical Distribution of City-Owned Renewable Energy Installations in Toronto between 2010 and 2019

Figure 4: Geographical Distribution of City-Owned Renewable Energy Installations in Toronto by size

Discussion

While this dataset and analysis provide valuable insights into the Toronto City Government’s dedicated efforts to expand and diversify the city’s renewable energy infrastructure, it’s important to acknowledge the limitations. Since the dataset primarily focuses on city divisions and excludes data from other city agencies or public-private partnerships involved in renewable energy initiatives, the analysis may overlook significant installations managed by other entities. However, the approach taken in this analysis can be applied easily to similar datasets, such as considering installation types when interpreting size trends.


Moreover, the data only covers installations up until 2019, leaving out more than five years of potential progress. Toronto’s efforts are part of a broader commitment to addressing climate change, aligning with Canada’s $964 million investment in renewable energy projects under the Paris Agreement. As the largest city in Canada, Toronto stands at the forefront of sustainable urban development with initiatives like SolarTO and the Conservation Authority’s Renewable Energy Program. The city’s Net Zero Carbon Plan aims to achieve net zero emissions in city buildings by 2040. While this goal and current action are commendable, the city must continue or even intensify its efforts to ensure it meets its ambitious targets and sets an example for other cities worldwide. This analysis contributes to a better understanding of Toronto’s commitment to renewable energy, paving the way for informed discussions on the city’s progress and future endeavours in combating climate change.

Sources

Powering open data with civic issues

Hey Reham! Let’s start by telling our readers a little bit about you and your role with Toronto Open Data. I’m Reham Youssef. I’m the Marketing and Communications Lead for the open data team, and I’ve been with the City for over fifteen years but in the last ten years, I’ve really been focusing in on open data. Tell us about the project you’re currently working on. Sure. At open data, we always want to know: what do users want? In the past we would accept dataset requests through email and twitter on an ad-hoc basis. We didn’t really have a concise way of gathering data from the public on what they want when it comes to open data and we’ve been trying to establish different criteria that will help us evaluate and prioritize dataset releases. The Mayor recently announced three new city directives. We took that list and identified two more: climate change, which is an important global issue right now, and poverty reduction, which we heard as a consistent theme from the civic tech community. This gave us a list of five total civic issues we wanted to prioritize dataset releases around: affordable housing, poverty reduction, fiscal responsibility, climate change, and mobility. We’re now considering the impact of these civic issues in determining what we release next. Why is it important to know how to prioritize what datasets get released? It’s just a better way to understand data requests, and we wanted to make sure that incoming data requests align as much as possible with these priorities. We can’t release everything at once. Unfortunately we have limited resources and (wo)manpower, and we also have to work with 44 different City divisions. As you can imagine, there are numerous units within the individual divisions that are responsible for data. When we prioritize data, we link it to the city priorities which you can then narrow down to 5 to 10 divisions, which makes it much easier for us to discuss with those divisions instead of all 44. Often times it’s tricky to ask for data when we don’t even know how or where it exists, let alone if it exists in the first place. So we decided to focus in on the most important things for the city. We want to establish a consensus on what users want, and specifically go after that data. Great! So you started by developing a survey that would be filled out by users to inform what civic issues and datasets they wanted. How did that go? Well, when we first went out the door with the initial civic issues survey, we unfortunately didn’t get what we were looking for the first time. Why is that? There were just too many questions, and we were asking for too much detail at the time. People didn’t understand how to respond. We really wanted people to tell us as much as possible, right down to the data attributes, and exactly what they wanted. We started to see that while people know they want to solve problems like affordable housing, they don’t always necessarily know what data they need to address that problem. So what did you do when you found that users weren’t filling out the civic issues survey as expected? After a month, we took the survey down and switched things around. That’s how we got to the second questionnaire, where we simplified it down to 3 simple questions. The wording had to change for people to think differently, and asking people what they wanted and what they wanted to do with the data was the main driver. The reason we asked for what they will do with the data is because of a fun little algorithm we created that I’ll go into detail later. What was the result? Unexpectedly, we were overwhelmed with responses! 875 in total. We received enough information from our users the second time around that we could plug right into the prioritization framework. This gave each dataset request a score as well as informed us as to how many people were asking for the same data in different words. Can you tell our readers a little bit about the prioritization framework? The prioritization framework was developed by the open data team as a way to assess and prioritize upcoming dataset releases. The algorithm was developed by Ryan Garnett, who manages the open data team as well as the geospatial competency center (Editor’s note: Ryan writes frequently for the Open Data Knowledge Center; you can read his articles here) in the form of a dynamic spreadsheet. The framework focuses on five outputs. Each output is given a specific score. The algorithm applies a weighting to all these scores and then outputs an overall ranking. The primary assessment metrics are:

Source

Is the data in a database somewhere or is it in a spreadsheet on someone’s desktop?

Civic Issue

A dataset request that isn’t related to one of the five main civic issues would receive a lower score than one that does.

Requester

Who requested the dataset? Is it requested by council or a member of the public? Different requesters are given different scoring.

Output

What will it be used for? Education? Media? Government city report? Is it for research? Will it be used to create a by-product, like an app?
So the framework lets us figure out where the priority lies for every request. The higher the number, the more we will focus in on getting that data from the appropriate division or unit. Do you think the scores will be made public anytime soon? They’re already public but they aren’t presently linked to the prioritization framework! We’ve got a few more things to do first. We’re reviewing all of our requests, separating one single request perhaps into two or more data requests, manually tagging them with topics or themes and separating them out by civic issue. We’re actively working on releasing that data to the public in a raw format. It’ll contain every single request with the requester listed. What is the benefit of releasing that data to the public? It’ll be our first actual open dataset from open data. Data about the data! Yes, exactly. This will give people the opportunity to take a look and play around with the data. We’re going to analyze the data and figure out how many requests there are based on tags. We’ll plug each one into the framework, then come up with a score that we then report back to the senior management team (SMT) or appropriate division. That we hope to release soon, publically, and ultimately this is how we want to report back. That seems fairly well-aligned with the mandate of open data, which includes transparency as well as data-informed decision making. That’s right. So what are your next steps as this seems like a fairly mammoth undertaking. We’re going to clean the data for the release. My personal next steps are to perform a little bit of an analysis so we can have a cool data story to report back with. I’m not a data scientist, so I’ve never worked with so many sources before, but I’m excited to venture off and try this. I want to have some sort of summary story that tells us the total number of requests, unique tags, and breakdown of civic issues. Ultimately we want users to understand how we took nearly a thousand requests and categorized them with tags and top civic issues. Thank you, Reham, for explaining the open data civic issues campaign and prioritization framework. We welcome reader comments and questions to opendata@toronto.ca.

Civic Issues Initiative

In this four-part series, we introduce our readers to the Civic Issues campaign. This campaign highlights some of the most important socio-political issues impacting Toronto residents, including hot-button items like housing affordability and poverty reduction. The Civic Issues Initiative survey
What do civic issues have to do with open data? It turns out, quite a bit. It’s important that open data releases reflect the concerns and interests of the city’s residents. Releasing in-demand open data is one way to increase community participation in civic tech, increase data literacy, and activate data-driven decision-making. Solving complex civic problems means ensuring there is a seat at the table for underrepresented voices. Join us as we take you through how we are improving the way open data is created and shared in the city to bring you more of the data you want, when you want it.

How do we currently acquire data requests?

Since the start of Toronto’s Open Data program in 2009, we’ve used many ways to determine what to publish, and when. Some of these ways include keeping tabs on formal and community requests through our e-mail inbox, public consultations, and requests from the media. Once we’re alerted to a dataset request, we connect with the appropriate division to find out if they have the data that’s being asked for, and assess how much effort is involved in acquiring it prior to publishing it. The Open Data team has a highly engaged following on Twitter, which has served as one of the primary ways in which the community can tell us about what they’re interested in. We also track current events and the media to establish the demand for a specific set of related information. We recently launched our monthly newsletter as well, The Open Data Update, which encourages our readers to contact us with requests for data.

Following a request, how is the data acquired?

City divisions like Transportation Services or Parks, Forestry and Recreation can periodically provide us with a ready to go real-time data feed, but not always! A lot of this data is subject to the technical limitations of the time at which it was collected, so much of it is buried. Even when we have access, the formats might be out of date, and there might be issues with consistency. We see these issues in many of the datasets currently hosted on open data. This means that the data would need to be cleaned up, undergo an extensive privacy review, and/or need to be digitized prior to release. It’s a lot of work. There are an estimate 9 petabytes of data in the City, and not all of it can be made open due to privacy, licensing, or technical restrictions either. So it’s essential that we prioritize our releases based on the value their provide to service provisioning.

How does all this relate to civic issues?

As a civic campaign, we’re obligated to demonstrate the socioeconomic value of open data in our reporting. Like other cities, we struggle with demonstrating the true social value of the data that we provide. Many times, we aren’t able to truly demonstrate just how impactful open data can be to a typical resident of the city. It can seem too technical or too bureaucratically inaccessible. How do we change this perspective, and democratize access to open data? Let’s consider some non-technical challenges. Often, the decision-makers in the room don’t usually represent the groups we need to provide services to the most. These are groups with limited data literacy, limited mobility, or economic insecurity. Lived experience is often the best way to understand the unique experience of someone who may be under-housed, or struggle with transit affordability. We need to make sure that we don’t overlook the importance of these communities, and so we want to prioritize the release of data that can positively influence change and provide opportunities for improvement. Evidence shows that decision-making models that involve affected communities and prioritize their needs are typically the most sustainable and scalable.

How do we understand social value?

Let’s pause for a moment and think about a common experience many residents have. Prior to the existence of smartphones, it was difficult to predict transit delays, and commuters had few options outside of waiting. Through access to historical transit data, a frustrated commuter was able to develop an app-based solution that can predict the arrival time of your bus with a high level of accuracy. This example, and countless others from our community, demonstrate the value of open data in case studies that are identifiable to a diverse range of city residents. As such, we have a responsibility to dismantle the barriers that contribute to the under-representation of marginalized communities in civic technology. So how do we truly engage a diverse audience? How do we ensure everyone gets a seat at the table? How do we ensure that we balance feelings with facts to create policies that benefit the residents of Toronto? Simple. We listen. In the interest of working within a data-driven government model, sometimes this will mean delving into the uncomfortable, and being honest and transparent with data that shows us where there’s room for improvement. Open data is also about self-sufficiency. We want to reduce barriers to access. We want anyone who wants our data to be able to use it openly and transparently, whether they’re going to start a business or create a community campaign in support of a social issue they care about. The Civic Issues Initiative survey