# Part II - The nature of data

> *If you don't ask the right questions, I can't give you the answers and if you don't know the right question to ask, you're not ready for the answers.*
>
> **Ed Parker**

### Data everywhere <a href="#id-133ccca4-bc7a-4a4c-90a4-e4a57df19183" id="id-133ccca4-bc7a-4a4c-90a4-e4a57df19183"></a>

It is understood that Information Theory started in 1948 with Claude Shannon's “A Mathematical Theory of Communication”. Data wasn’t born that day though sure enough from then on we experienced an exponential growth in computing power, communication capabilities and relentless data ingestion.

Data is necessary and can’t be stopped. Wearing glasses signals some sort of eye problem. Sharing a meal with others, tells you about their taste in food and possibly other personal preferences. We emit and receive data all the time and this is really necessary or otherwise we wouldn’t be able to make sense of our environment and orient ourselves through it. All of our decisions, while sometimes not properly informed, are data-driven.

And so the question begs being asked: ***what is data actually***?

### Defining "data" <a href="#id-03208a70-254e-4d5d-b594-ad1a7b10914c" id="id-03208a70-254e-4d5d-b594-ad1a7b10914c"></a>

I often feel that the current common perception of data is something along the lines of the magical dust floating in Middle Earth and that only some wizards can channel it through their arcane knowledge, mix it with other obscure elements and produce an otherworldly outcome to the marvel of us mere mortals.

Well, nothing further from the truth. Human-quantified information is only useful to humans and this only if it is sufficiently contextualized. In other words, by itself number 75 means nothing and thus has no value. Now, if we are to establish that it’s the number of beats per minute from my heart then we can suspect that, in general, I am healthy.

Here’s the important part: what gave meaning and value to that simple piece of data was to contextualize it sufficiently. That and only that.

Over the past decades a number of interpretations about the nature of data have been the center of rather heated debates. We live under a cocktail of them and this is making it more difficult to address all the malpractices we are too used to already.

* The *Data as IP* approach, considers that “your data is like a song and if someone else is whistling it you should get paid for it”. The problem? This view detaches us emotionally from our data, making it nearly impossible to feel compelled to protect it unless for commercial reasons.
* The *Data as Labor* viewpoint establishes that everytime we interact with technology, say posting a picture or writing a restaurant review, work is generated. This keeps opening the door to deceptive ideas that we can and should monetize our data.
* Others prefer to observe *Data as Infrastructure*, arguing that data is only a means to build services and products. Again, a position where we see data as an external tool that has nothing to do with us.

There is however a more straightforward way to understand data that in fact consolidates all the above propositions and gives us a sense of how misusing data leads to harms: ***You are your data***.

It’s really that simple. All the data collected about you, creates models of yourself.&#x20;

Data, when properly organized using schemas, generates models that represent you. We call those digital twins.

When you look at data as an extension of yourself, concepts such as consent or data trafficking feel totally different. We may look deeper into all these concepts in future articles.

Some will quickly jump and argue *“but that opens the door to selling your data!”*.

Well… yes but no:

* The recognition of ownership does not imply the ability to sell something. I own my brain and yet can’t sell it.
* We sell ourselves for work on a daily basis. Recognizing the intimate relationship between us and our data would allow better regulation to minimize abuses.

Should the nature of data matter to you? For one thing, **it is you**. That’s precisely what big tech and all authoritarian governments have understood long ago and why they hoard your data. Or consider any of the nowadays all-too-common data breaches exposed worldwide. In the alleged JPN breach, would you say it would be data from 4 million Malaysians being circulated or rather 4 million Malaysians being exposed naked?

We hear often enough that you can’t get the right answers if you don’t ask the right questions. It seems to me that we asked the wrong questions when we started massively collecting data without understanding its true nature and that along the way we seriously turned a blind eye to the right questions because we were not ready for the actual answers.

A parting thought: *Information is power* and we currently do not control in effective ways who has access to it; essentially because we grew detached from it.

### So what’s next? <a href="#bef13c5f-5097-4bdb-a899-2c80f724f536" id="bef13c5f-5097-4bdb-a899-2c80f724f536"></a>

Better understanding the nature of data allows us now to dive into who should be taking care of what in the next episode.

<table data-card-size="large" data-view="cards"><thead><tr><th></th><th data-hidden data-card-cover data-type="files"></th></tr></thead><tbody><tr><td><p>Jean F. Queralt (John) is the Founder and CEO of <a href="https://TheIOFoundation.org">The IO Foundation</a>, a tech nonprofit advocating for <a href="https://TIOF.Click/DCDRAbout">Data-Centric Digital Rights</a>.</p><p></p><p>Disturbed by the level of intrusion of technology in the lives of citizens, he took the leap in 2018 of starting The IO Foundation to establish a more solid and targeted direction to address users' protection from a technical standards perspective.</p><p></p><p>He is actively involved in Standard Developing Organizations such as the <a href="https://ITU.int">ITU</a>, <a href="https://IETF.org">IETF</a> and <a href="https://ICANN.org">ICANN</a>.</p><p></p><p>Because he regards technologists as the <a href="https://TIOF.Click/TIOFNextGen">next generation of rights defenders</a>, he works to raise awareness on the importance of these organizations across the technical community and facilitates their participation in them.</p></td><td><a href="/files/hoi82VoZrEmCNWFiWKTT">/files/hoi82VoZrEmCNWFiWKTT</a></td></tr><tr><td><a href="https://TheIOFoundation.org">The IO Foundation</a> (TIOF) is a global nonprofit advocating for Data-Centric Digital Rights, born out of a fundamental concern on the future of digital communities, both in their governance and implementation. TIOF aims to provide platforms to raise awareness on Digital Rights as well as effective solutions to ensure that they are adequately observed. For more information, please visit <a href="https://TheIOFoundation.org">www.TheIOFoundation.org</a> and reach out via <a href="mailto:Contact@TheIOFoundation.org">Contact@TheIOFoundation.org</a>.<br><br><br><br><br><br><br><br><br><br></td><td><a href="/files/ihBYQ1xovcJpfG2MbZy4">/files/ihBYQ1xovcJpfG2MbZy4</a></td></tr></tbody></table>


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://discover.theiofoundation.org/publications/articles/a-penny-for-your-bytes/part-ii-the-nature-of-data.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
