Thursday, October 30, 2025

Interview with Prof M Vidyasagar, FRS Part 1

 The first part of my interview with Prof M Vidyasagar, FRS has appeared today in Rediff and the second part will appear soon.

https://www.rediff.com/news/interview/india-builds-best-when-it-builds-alone/20251029.htm )

Why The Americans Were Opposed To Agni

Last updated on: November 06, 2025 12:38 IST


'The heat shield technology for re-entry vehicles was first mastered in DRDO for the Agni missile.'
'This is why the Americans were so opposed to Agni in the 1980s, unlike other missiles -- it was a re-entry vehicle.'

IMAGE: The Defence Research and Development Organisation successfully flight tests the new generation Agni P ballistic missile, in Balasore. Photograph: ANI Photo
 

Professor M Vidyasagar, FRS, is a Distinguished Professor at IIT Hyderabad. He earned his BS, MS, and PhD in electrical engineering from the University of Wisconsin.

His distinguished career includes academic positions at universities in the USA and Canada, followed by leadership roles as Director of India's Centre for AI and Robotics and Executive VP at Tata Consultancy Services.

He held a chaired professorship at the University of Texas in Dallas before his current position.

IMAGE: The LCA Tejas Mk1A takes off on its maiden flight at Hindustan Aeronautics Laboratory in Nashik, October 17, 2025. Photograph: ANI Photo

A major feather in the cap of his team at the Centre for AI and Robotics is the creation of world class digital control software for the Light Combat Aircraft Tejas indigenously.

His research interests span stochastic algorithms, convex/nonconvex optimization, reinforcement learning, and machine learning, with recent work focusing on the theoretical foundations of stochastic gradient descent and Large Language Models.

Professor Vidyasagar, a prolific author of 13 books and over 160 peer-reviewed research papers, was appointed a Fellow of IEEE (the largest professional engineering body in the globe) for his advanced contributions to Control Systems at a very young age.

His groundbreaking contributions have been recognized with numerous prestigious honours, including Fellowship of The Royal Society (FRS), the IEEE Control Systems Award, the Rufus Oldenburger Medal (ASME), and the John R Ragazzini Education Award (AACC).

He is also a Fellow of multiple national academies in India and a recipient of a SERB National Science Chair.

"I believe a significant handicap is that the critical need for our own technology to win wars has never been burned into the memory of our government and armed forces. But let's look at the positive achievements. The nuclear submarine, the aircraft carrier, the Tejas fighter, and the guided missiles programme -- including newer missiles and the excellent BrahMos joint venture -- are all visible, major system successes," Professor M Vidyasagar tells Shivanand Kanavi. The first of a two-part must-read interview.

A major feather in the cap of his team at the Centre for AI and Robotics is the creation of world class digital control software for the Light Combat Aircraft Tejas indigenously.

His research interests span stochastic algorithms, convex/nonconvex optimization, reinforcement learning, and machine learning, with recent work focusing on the theoretical foundations of stochastic gradient descent and Large Language Models.

Professor Vidyasagar, a prolific author of 13 books and over 160 peer-reviewed research papers, was appointed a Fellow of IEEE (the largest professional engineering body in the globe) for his advanced contributions to Control Systems at a very young age.

His groundbreaking contributions have been recognized with numerous prestigious honours, including Fellowship of The Royal Society (FRS), the IEEE Control Systems Award, the Rufus Oldenburger Medal (ASME), and the John R Ragazzini Education Award (AACC).

He is also a Fellow of multiple national academies in India and a recipient of a SERB National Science Chair.

"I believe a significant handicap is that the critical need for our own technology to win wars has never been burned into the memory of our government and armed forces. But let's look at the positive achievements. The nuclear submarine, the aircraft carrier, the Tejas fighter, and the guided missiles programme -- including newer missiles and the excellent BrahMos joint venture -- are all visible, major system successes," Professor M Vidyasagar tells Shivanand Kanavi. The first of a two-part must-read interview.

IMAGE: DRDO successfully conducts the maiden flight test of the Integrated Air Defence Weapon System off the coast of Odisha. Photograph: @rajnathsingh X/ANI Photo

Thank you for your time, Professor Vidyasagar. There are many diverse things I want to talk to you about in which you have great expertise and experience. To begin with, what have been the major achievements of the Defence Research and Development Organisation (DRDO)?

DRDO has done extremely well in areas where it hasn't had to compete with the 'import option'.

Take missiles, for example. If you want a missile, you have to build it yourself; nobody will sell it to you.

This is also how ISRO succeeded -- you can't import a satellite launch vehicle.

In such segments, DRDO and ISRO come out looking very good. They have good scientists, but they also aren't constantly watching their backs against import lobbies.

Take for instance, the ring laser gyroscope developed by DRDO under the Integrated Guided Missile Programme.

Older missiles used mechanical gyroscopes; the new laser gyroscope is far more accurate.

Nobody would give that to us, so DRDO developed it indigenously.

IMAGE: The indigenous MBT Arjun MK-1A tank. Photograph: DRDO/ANI Photo

However, in areas where an import option exists -- like fighter jets, main battle tanks, or towed artillery guns -- the moment DRDO gets close to a solution, someone shows up offering to sell it.

The sellers also have an interest in undermining DRDO.

What I've never understood is what's in it for the buyers, our government of the day and armed forces?

I believe a significant handicap is that the critical need for our own technology to win wars has never been burned into the memory of our government and armed forces.

But let's look at the positive achievements.

The nuclear submarine, the aircraft carrier, the Tejas fighter, and the guided missiles programme -- including newer missiles and the excellent BrahMos joint venture -- are all visible, major system successes.

There are also numerous critical subsystems developed that aren't as visible.

For example, the heat shield technology for re-entry vehicles was first mastered in DRDO for the Agni missile.

This is why the Americans were so opposed to Agni in the '80s, unlike other missiles -- it was a re-entry vehicle.

We also had to master the technology of mounting printed circuit boards in these missiles that can withstand extreme shock, vibration, and temperature.

The US had a far better ecosystem for spinning off military technology for civilian use --Teflon came from the space programme.

DRDO could have done this, but it was stifled by financial and ideological hurdles.

When Dr Kalam was director general of DRDO, we explored commercialising DRDO technology.

The problem was the finance ministry was terrified that an industry might buy a license and make a fortune.

Our attitude was DRDO has no mechanism for marketing, scaling up, or paying competitive salaries.

If we don't commercialise, the technology will die. Let someone take a chance and commercialise it.

But the officials were scared of audit objections from the CAG.

Their solution was to ask for a ridiculous amount of money upfront to avoid any future allegation of underpricing.

IMAGE: The Mounted Gun System indigenously developed by involving Indian industry, defence public sector units and academia, under the leadership of DRDO, displayed at the Vehicles Research and Development Establishment in Ahmednagar. Photograph: Video Grab/ANI Photo

If we had adopted a model of taking a small royalty or an equity stake in the commercialising company, many small technologies could have proliferated into society.

Unfortunately, that model was unthinkable in the '90s. This resistance has prevented the diffusion of many technologies.

Fortunately, some groups with deep pockets like the Tatas can take chances.

For instance, some drones used abroad are actually made by a small, young entrepreneurial company in Hyderabad -- not a big corporate.

For example, armour-penetrating explosives could have had mining and other civilian applications. Similarly the laser gyro, heat shield etc.

The commercialisation never took off because our finance people were scared to make decisions.

I don't necessarily blame them; they fear audit objections. The whole system is messed up.

This is also why government startup funds often go unspent -- no one wants to take the risk inherent in venture capitalism, where you expect many failures for one big success.

This culture of audit objections stifles innovation, and no government, including the present one, has shown the willingness to truly understand and fix this problem.

You were the founder director of the Centre for Artificial Intelligence and Robotics (CAIR) at DRDO. Please tell me how that came about and what was achieved there.

It was an interesting time.

I was a professor at the University of Waterloo, Canada; at the top of my academic career, but my wife and I always wanted to return to India.

In 1987, I took a sabbatical and met Dr Kalam at DRDL (Defence Research and Development Laboratory) in Hyderabad.

I was struck by how hard everyone was working -- a testament to the impact of a charismatic leader.

I told him I was thinking of returning to India, and he said he'd look for an opportunity.

In September, just two months later, he told me that Dr V S Arunachalam, the scientific advisor to the defence minister, wanted to start a project on AI but couldn't find a leader.

He asked if I would lead it. I was interested but had to finish guiding my four PhD students in Waterloo.

I asked for a year's joining time, which was agreed upon.

When I joined DRDO, the 'CAIR Project' had a two-year lifetime in government terms.

My first task was to turn this project into a permanent establishment -- a laboratory.

This required navigating bureaucracy and defining what we would do.

I was met with tremendous goodwill, primarily because in 1989, the idea of a well-established academic returning to India was almost heretical. It rarely happened.

We defined our areas: In artificial intelligence, we started with rule-based expert systems and quickly moved to neural networks, which had just emerged in 1986.

In robotics, we coordinated with BARC's (Bhabha Atomic Research Centre) strong Division of Remote Handling and Robotics, led by the amazing Ram Kumar.

We decided that anything in a radioactive environment would be handled by BARC, and anything outside that would be our domain.

We were primarily an internal consultant for DRDO, not expected to build finished products but to be an in-house resource.

For example, we built a fault detection expert system for a radar being developed by LRDE.

We also developed a robotic system to uniformly coat the canopy of the Light Combat Aircraft with radar-absorbing material -- a critical stealth technology, as the canopy is the most reflective part of an aircraft.

We also did basic research. I probably had the highest percentage of PhDs in any DRDO lab.

By 1994, we had about 40 scientists and 80-85 people total, including support staff.

To get hiring flexibility, we set up a non-profit society called the Institute of Robotics and Intelligent Systems (IRIS), with the DRDO secretary as the ex-officio chairman.

This allowed us to hire on contract and pay somewhat higher salaries.

Our biggest success was the control law for the LCA.

It started around 1992. Girish Deodhare, who just retired as DG of ADA, was my PhD student at Waterloo and joined CAIR.

Dr Kota Harinarayana was the programme director of ADA.

Initially, Martin Marietta was supposed to design the control software. They would talk big but belittle Indian capabilities and refused to share design documents, only giving final numbers.

Dr Kalam, who took over as DG in July '92, called a meeting -- on a Sunday -- and appointed Professor Roddam Narasimha FRS, to head a committee to decide if we could build the control law indigenously. We all said we could.

These foreign companies weren't impressive technically and weren't offering a knowledge transfer.

We decided to go for a state-of-the-art digital fly-by-wire control with a 32-bit floating-point processor, which was advanced for the time.

Professor Narasimha recommended we do it ourselves, and Dr Kalam agreed. He appointed Professor I G Sharma of IISc as an independent assessor to report to the ADA governing council every three months.

We designed the first cut of the control law in about two years.

A major hurdle was computing power. Due to international restrictions, we couldn't get powerful computers in India.

We rented time on a British Aerospace flight simulator in the UK.

Our team would go there with the software on tapes, and our test pilots would fly the LCA control law on that simulator.

Interestingly, the BAE guys, who were also working on the Eurofighter, invited our pilots to try their simulator.

Our pilots later reported that they found the LCA's handling qualities to be better!

The control law's quality is best attested by the fact that the LCA is the first fighter aircraft in the world to be inducted into service without a single crash during its development phase.

Because it was completely in-house, and the test pilots who flew it eventually rose to senior decision-making positions in the air force, the system was well-accepted even though the import option always loomed.

The economics also flipped. In 1989, the assumption was that the LCA would be three times more expensive than a Mirage but would be ours.

Twenty years later, it was not only ours but also only 25%-30% of the cost of any imported equivalent fighter alternative.

There was an export market for an affordable, capable fighter, but we lacked the production capacity and the mindset to engage a private sector partner for mass production and marketing.

The ingrained mindset, fearing 'someone might make money', prevented this.

HAL's production rate was initially 6 per year, then 18 -- we need 50 to 60 per year to meet demand and export.

We see the same issue today with the Advanced Medium Combat Aircraft (AMCA).

The project was 'sanctioned' two years ago, but not a single rupee has been released.

Most people don't know that government sanction means expenditure power for when money is released, not that funds have been allotted.

Many times I doubt if our armed forces are serious about wanting indigenous equipment; they seem to delay until they can claim an urgent need to import.

Countries like the US field-test their equipment in local wars in the globe, creating testimonials for their technology.

In India, field trials are often used as an excuse to delay induction.

-----------------------------------------------------------------------------------------

Shivanand Kanavi, a frequent contributor to Rediff, is a theoretical physicist, business journalist and former VP at TCS.
He is the author of the award winning book Sand To Silicon: The Amazing Story Of Digital Technology and has edited Research By Design: Innovation and TCS; a chronicle of over 30 significant outcomes and case studies of TCS R&D from 1981-2006.

Feature Presentation: Ashish Narsale/Rediff

Wednesday, October 29, 2025

First P.S. Deodhar Memorial Lecture by Shivanand Kanavi

 

Can we develop world class technology in India, for India and the world?


September 25, 2025, at Vidyalankar Institute, Mumbai

(Edited excerpts of the talk)



P S Deodhar (Sept 25, 1934 - Jan 28, 2024)

First, I want to thank two institutions for inviting me. One is the National Centre for Science Communicators, and my friends Suhas, Dr. Barve and Anant Deshpande for the honor of delivering this first memorial lecture for P.S. Deodhar. I also thank Dr. Ashish Ukidve and the Vidyalankar Institution—its faculty and trustees—for hosting this interaction with you all who are the future of Indian technology.

You are that future. You have already heard much about Deodhar ji. I never had the chance to meet him in person.

I first heard of him around 1977-78. Two of my friends, from Cornell and Carnegie Mellon, also returned from the US with me and took their first jobs at Aplab. That was my introduction to Aplab, a creation of P S Deodhar in Electronic Manufacturing.

Much later, about 10-15 years ago, Mr. Deodhar read my book “Sand to Silicon: Amazing story of Digital Technology”, and sent me an email. We exchanged a few messages, and around 2013, he called up and spoke to me about his book on China. That was the extent of our personal contact.

I advise you all to look him up on LinkedIn, where his professional contributions, books, and a comprehensive list of his innovations are detailed.


Shivanand Kanavi delivering the First P S Deodhar memorial Lecture Sept 25, 2025

Since you have already heard about him from those who knew him well, I will not repeat it all. Briefly, his professional contributions spanned 50 years, from 1962 to 2012. From 1985 to 1989, he was the head of the Electronics Commission, the apex body for electronics and IT policy at the time—a role very different from today's scattered policymaking.

He was also a member of the Broadcast Council (1992-93), whose recommendations helped liberalize India's broadcast services, moving beyond just state-run Doordarshan and Akashvani and paving the way for private channels.

His association with the Marathi Vidyan Parishad lasted from 1999 to 2014. He was also a keen observer of China, serving as Chairman of the India-China Economic and Cultural Council from 2004, which informed his book, Sina Sthana.

He wrote three books: Capital Punishment (1993), a pun on his experiences in Delhi; The Third Parent (1995), on broadcast reforms; and Sina Sthana (2013), on China. He also wrote a four-part autobiographical series, "Problems of Being an Indian Entrepreneur," published in Money Life magazine.

His innovations were primarily in power electronics and telecommunications, including pioneering work on an Indian ATM and the concept of smart cards—the technology behind your credit cards, driver's licenses, and FastTags.

He designed a smart card-enabled telephone. To your generation, this might not seem impressive, as you've lived your entire lives after the digital revolution. To understand a revolution, you must know what came before.

When I went to the US, to call home, I had to go to the central telegraph office, book a "lightning call," wait for hours, and pay 160 rupees for three minutes of poor connection. You live in the era of free WhatsApp video calls.

For us, email was a revolution. In 1993, when I was working in Business India, they provided me with the first email services. It was text-only, as the web didn't exist in India until VSNL introduced it in 1995. We used modems with speeds of 2.4 kilobytes. We still relied on fax machines, and our telephone bills were enormous.

When I traveled from my hometown, Dharwad, to IIT Kanpur, in 1972-74, it involved four train changes. Securing a reservation was a lottery. Today, you book everything instantly on your phone via IRCTC app. This is the revolution you were born into.

The days when pioneers like Mr. Deodhar dared to design and build electronic systems in India were very different. Electronics was nascent globally, but India also lacked foreign exchange to import anything. The challenge was: if we can't import modern technology, we must build it ourselves. That was their determination.

The challenges of the 60s and 70s were about self-reliance. The challenges of the 90s, during a global wave of liberalization, were different. Today's challenges are again different.

Look at the world's most valuable companies today. NVIDIA, an electronics chip company, has a market capitalization of $4.3 trillion—more than India's GDP. Apple is another $ 4 trillion company. Apple designs its products, TSMC makes the chips for Apple, and Foxconn assembles Apple’s iPhones and iPads. Foxconn's market value is around $50 billion.

In India, we celebrate assembling iPhones and creating thousands of jobs. But the value addition is only 5-6%. We are happy with that 5%. Foxconn, the assembly giant, sets up plants here, and we celebrate it. Tata Group is now trying to build the entire chain from chips to assembly, but this will take 5-10 years.



Modern manufacturing is not manual labor; it's highly automated "smart manufacturing." You, the future engineers, will be part of this ecosystem—managing the IT systems, data centers, cybersecurity, and communication links that power it.

So, can India produce technology for itself and the world? History shows that until about 1600, India and China led the world in ideas and technology. We had advanced zinc and steel metallurgy, and superior forged cannons in the 15th and 16th centuries.

Technological development often happens despite government policy, not because of it. The Indian IT services industry is a classic example; it grew under the radar, solving problems and finding its own way.

What Indian IT achieved, starting with TCS and followed by others, like Wipro ad Infosys, HCL etc was the industrialization of software development. We moved it from an artisan's craft to an engineered process, breaking down complex projects into standardized tasks managed by different teams and then integrate all the components and subsystems seamlessly into a solution. It is this that the whole world now comes to India for.

Have Indians achieved excellence in modern technology in the 20th century? Let us look at a few cases.

Consider Jagdish Chandra Bose. In the 1890s, in Calcutta, he developed components for radio and pioneered solid-state electronics, filing a patent in 1901. But who commercialized radio and won the Nobel Prize? Guglielmo Marconi.

This pattern repeats. We can develop technology, but often fail to productize it. Look at all the foundational technologies of modern life—the telephone, telegraph, computer, internet, car, plane, mobile phone—none were invented here. We missed the Industrial Revolution.

But in the digital age, Indians have made seminal contributions, though often abroad:

·         Narendra Singh Kapany: Invented "fibre optics" in the 1950s.

·         Bishnu Atal: Developed Linear Predictive Coding, used in every mobile phone.

·         Arun Netravali: A key figure in the development of HDTV.

·         Yogen Dalal: Co-developed the TCP/IP internet protocol.

·         Sanjay Mehrotra: Co-Invented the thumb drive.

·         Rajiv Motwani: Was a key mentor and collaborator to Google's founders; his work is foundational to their search algorithm.

·         Bala Manian: Received a Technical Oscar for his contributions to the technology used in Hollywood's computer-generated graphics.

·         Sam Pitroda: His work on digital switches led to the creation of C-DOT, which digitized India's telecom network.

The point is, there is nothing wrong with the Indian brain. We are fertile and inventive. We must build the ecosystem and encourage innovation and risk taking in technology.

We should not overestimate our achievements and start declaring we are a tech power etc. That is nonsense, but don't suffer from an inferiority complex either.

The above examples were mostly work done by Indians abroad. Has India is also developed world-class technology at home?

Let us see some examples:

·         UPI: The digital payment system developed by the National Payments Corporation of India (NPCI) is now studied globally.

·         ISRO: Its satellite launch services (PSLV, GSLV) and remote sensing capabilities are world-class and hundreds of foreign satellites have been launched by ISRO besides satellites made in India.

·         Pharmaceuticals: Companies like Dr. Reddy's, CIPLA, Lupin and others fought patent cases globally, proving their chemical processes were cheaper and better, thereby establishing Indian pharma.

·         The National Stock Exchange (NSE): Its trading platform, designed by Indian engineers, was more efficient than NASDAQ's and attracted delegations from around the world who came to study it

·         Guided Missiles: The Integrated Guided Missile Programme (IGMP), started in the early 80s under Dr. A P J Abdul Kalam, developed guided missiles like Agni and others.

·         Fast Breeder Reactor: The 500 MW reactor in Kalpakkam, once commissioned, will be a major breakthrough in nuclear power, producing more fissile material than it consumes.

My final point is this: many critical technologies cannot be bought at any price. No one will give us the technology for jet engines for our fighters ; we must develop them ourselves. The Gas Turbine Research Establishment in Bangalore has worked hard, progressing from 50 kN to 70 kN, but we need 100 kN to power next-generation fighters like the Tejas. The Chinese, too, depended on Russian engines but are now developing their own.

So, can Indians create technology for India and the world? The answer, based on our history and present, is a resounding yes. IT services, satellite launches, UPI, generic drugs, and the fast breeder reactor are just a few examples. These are major technological developments that nobody would simply give us. We had to, and will have to, develop them ourselves.

Thank you.

 

Thursday, October 9, 2025

'A Revolution In Indian Languages Is Waiting To Be Unleashed'

'A Revolution In Indian Languages Is Waiting To Be Unleashed'

By SHIVANAND KANAVI

( https://www.rediff.com/news/interview/mission-bhashini-must-read-interview/20251003.htm )

October 03, 2025

'The Bhashini Mission has delivered a working technology at large scale, which is as good as or better than the one with MNC tech giants.'

One of the dreams of the founders of Artificial Intelligence who met in Dartmouth College in New Hampshire, USA, in 1956 and produced a Manifesto of AI was speech to speech translation using computers.

Are we close to achieving it today? Will Mission Bhashini be the answer?

Professor Rajeev Sangal, a pioneering computer scientist, former director of IIT Varanasi and founder director of IIIT Hyderabad, offers a masterclass on AI and language technology.

A distinguished alumnus of IIT Kanpur and the University of Pennsylvania, Professor Sangal is a world-renowned expert in computational linguistics, best known for his groundbreaking work on the Computational Paninian Grammar framework for Indian languages.

He conceived Mission Bhashini and continues to guide it, as the founding Chair of the mission's executive committee.

He provides a rare, behind-the-scenes look at the conception and execution of India's ambitious Bhashini Mission for speech-to-speech translation, detailing its visionary strategy, the ethical dilemmas of AI opacity, the future of capturing linguistic nuance, and a strategic roadmap for India to achieve global leadership in domain-specific AI applications, making it an essential read for anyone interested in technology, governance, and innovation.

"Many people thought there is no way India can catch up with the MNC tech giants. These sceptics did not realise that Indian academia had the technical know-how, of building prototype models," the distinguished professor tells Shivanand Kanavi in a must read conversation.

 

Photograph: Kind courtesy bhashini.gov.in

How was Mission Bhashini conceived?

Mission Bhashini was thought of by the Prime Minister's Science Technology and Innovation Advisory Council (PM-STIAC).

I was approached by Professor K Vijayraghavan, then the chairman of the council, in September 2018, asking me if language translation could be addressed by technology, and to draw up a plan for it, particularly for S&T (science and technology) content in English.

I was happy that language technology was coming at the forefront of national priorities.

I had demonstrated the machine translation technology we had developed to our PM in February 2016 at BHU, when I was director of IIT Varanasi.

How did you go about conceptualising the mission?

When I was thinking about the mission, I had to look at the current situation -- the state of the technology, access to devices by people, and their needs that could be fulfilled.

By now, smart phones had come in the hands of a large population, they were wanting to access content in their own languages on the Internet.

At that time, the volume of Indian language content was in even smaller quantities than today.

Even today the total content in all the Indian languages put together is less than 0.1% of the content on the Internet. Yes, less than 0.1%!

So, providing English content translated in local languages to the common man would become desirable if the technology could be made ready.

The prevailing mindset at the time of conception of Bhashini in 2018-2019 was that India did not have a demonstrated working machine translation much less any speech to speech machine translation system. Many people thought there is no way India can catch up with the MNC tech giants.

These sceptics did not realise that Indian academia had the technical know-how, of building prototype models.

This was a result of research and development of the past 30 years with government funding.

Now, it was a matter of rebuilding those systems using the latest tools and approaches, and engineering the models for large scale use.

India had also gained experience in building Aadhaar and UPI. What is today known as Digital Public Infrastructure.

With the above capability and experience, it is no wonder that the Bhashini Mission has delivered a working technology at large scale, which is as good as or better than the one with MNC tech giants.


IMAGE: Professor Rajeev Sangal with Prime Minister Narendra Modi.

What were the key ideas in the conception of the mission?

One had to work out the scope, tasks, and types of uses. I felt that we should take speech to speech machine translation (SSMT), and not limit ourselves to text to text machine translation (MT).

This might look like a simple expansion of the scope, but researchers in MT and in speech processing are quite separate -- they work separately, and were often located in two different departments.

Would it be possible to make them work together, towards a single goal?

A workshop was organised with leading researchers from both the areas in January 2019 at IIIT Hyderabad, to discuss possible approaches to be taken in the mission.

Consulting colleagues from both the areas, I felt confident that there was a willingness to work together.

I knew that already there was a high level capability in both the areas in India.

It is a testimony to the strength of Indian academia and proper governmental funding in the past under the Technology Development for Indian Langauges (TDIL) programme of Meity (Ministry of Electronics & IT).

It is a testimony to the strength of Indian academia and proper governmental funding in the past, even though in the recent past, there was a lull in funding.

I decided to take the plunge for Speech to Speech technology !

Educational courses on NPTEL/Swayam, besides websites, were the prime targets identified to be used for training in Machine Learning.

It would allow students across the country to access content in higher education in their own languages.

Moreover, translation of formal lectures would be easier than conversations, because conversations use very short or partial sentences, and are highly contextual.

Complex technologies like speech processing and text to text translation make errors. So the system was designed to take human inputs as well, including corrections.

It would normally function as a human machine combination, although as its quality improves it could also be used in a fully automatic mode.

Finally, it was also decided that all 22 official Indian languages together with English would be covered.

The SSMT capability would be developed to translate among all these languages. When technology development is left to the MNCs, they choose to develop technology only in those languages for which there is a market need.

As a result, almost one third of the languages are left out completely. Here, we would cover all 22 official Indian languages.

What kind of technology was decided to be developed?

It was decided to develop AI models for spoken language translation. This included developing automatic speech recognition (ASR or speech to text) models, text to text machine translation (MT), and text to speech (TTS) models.

These technologies when put in a pipeline would give the SSMT system.

The pipeline would also contain, as needed, ancillary models for disfluency (breaks in speech) correction, named entity recognition, lip synchronisation, etc.

Parts of the pipeline would also be usable as a standalone MT system for text to text translation, or a transcription system for speech.

Human intervention would also be possible at every stage.

Such intervention would be important for correcting errors in recordings, though not in online live use.

This basic technology would also open up the market for tools and support applications of various kinds, such as summarisation, LLMs (which have come later), sentiment analysis.

It was also decided to build OCR technology for recognition of Indian language text from images.

The above SSMT pipeline would be built for all 22 official languages of India, and go even beyond these languages later.

If the technology is developed within the country, one has full control over it and one can put it to myriads of uses.


What were some of the strategic elements in the design of the Mission?

A question arose as to how the technology built indigenously ('Made in India') can compete with those developed by MNC tech giants like Google, Microsoft and Meta.

They have the Indian language data (hundred times more than what we possess), the compute resources, and have captured the market as well.

When AI systems are tested under standard artificial benchmarks, they perform very differently compared to use in real life situations. Each 'real life area' is a niche area.

The mission should have a mechanism for supporting the niche areas.

For each domain or application area, the mission should be able to help enhance technology and nurture startups.

The support would be provided through 'Technology Acceleration Centres'.

 

 

Startups in these areas can compete and win against MNC tech giants. These ideas were built into the mission document.

On the question of building strong research teams, the idea of 'consortium' of academic institutions was used to build critical mass of researchers in a project.

Language technology area needs computer scientists, linguists, Sanskrit grammarians, and also language experts of the concerned languages, all working together.

A single institution usually lacks the required expertise. The 13 approved consortia included 70+ research groups located in 30+ institutions covering 22 Indian languages.

It was possible to run such a large distributed mission, only because of the consortia approach.

Even though at times the accounting software in Meity and other ministries is making it very difficult for consortia projects to work.

This method of work has been crucial for progress.

On the issue of data, it was clear that a large amount of money would have to be budgeted for the creation of data for all 22 official Indian languages.

This would mean capturing spoken data and its transcription for all the languages, and parallel sentences in the original language and its translation.

It was also felt that to make the data freely available to Indian researchers and Indian startups, the data would be made open and freely downloadable by anybody.

Ironically this openness in the project would mean that this high quality data would be available to MNC tech giants also, for free.

They have much larger amount of data but of poor quality, gleaned from their users or from the Internet.

Therefore, I had reservations on this count, but mechanisms to restrict distribution of data do not work; they end up denying it to Indian researchers and startups.

Not only the data, the models were also made open source and freely downloadable by anyone.

The goal of Bhashini was not just to deliver a technology, but to build an eco-system for language technology, with all these elements.

What were the elements of the eco-system of language translation that were identified?

The eco-system that Bhashini seeks to develop consists of R&D groups, data creation and collection groups, technology acceleration centres, mechanisms for technology transfer, incubation of startups, participation by other companies, state governments, and the users including publishers, course-ware developers, government departments, end users, etc.

This eco-system would be nurtured by Meity using Bhashini funds.

One can think of them as being a part of three different cycles in society: a. technology cycle; b. market cycle, and c. social cycle.

Each of the cycles had to be made active, and moreover, these cycles have to be mutually reinforcing.

What are these cycles? Can you explain.

The first cycle is the technology cycle. It provides linkages between R&D and startups. R&D does research, finds new ways of doing things, builds lab prototypes to field prototypes - leading to new technology development and its demonstration.

Startups and existing companies take this technology, convert it into products, and service the customers.

However, for the technology to be transferred to companies, the technology has to be 'engineered' for robustness, ruggededness, and adaptation to needs of real life customers.

This task needs to be done by a separate entity, call it technology accceleration centres (TACs).

They have to connect with startups and help them solve problems which come in the way of adaptation of 'new' technologies.

This is called a cycle because there is a two way flow between the two.

The second cycle is the market cycle. It involves, for example, the content providers such as publishers giving services to their customers or end users.

However, they need modern translation tools as well as other AI tools, to make their tasks easier.

This is where technology based startups come in, in making the content in multiple languages or provide new kinds of services, including voicebots.

These help the providers in reaching their end users.

The third is the social cycle. The task here is to get a large number of people into creating Indian language digital content -- both original and translated, proliferation of use of language tools, contributing to languages through teaching, contests, games, etc.

The principle actors here are schools, colleges, language departments and academies, culture departments, students, state governments, and general masses.

This cycle yields love for culture and languages, encourages language aware and digitally trained manpower including e-translators, and of course, precious data.

Linking with state governments is an important step in this cycle.

These cycles are driven by their inner dynamics. Technology cycle is driven by knowledge, market cycle by money, and the social cycle by service.

In the mission, major progress has been made currently in the development of technology, and some progress with central government or its ministries as user.

The market cycle and the social cycle need to be specially energised, as they are much delayed.

What are the outcomes of the Bhashini Mission so far?

Mission Bhashini has led to the development of a range of technologies for SSMT (speech to speech machine translation) for Indian languages.

These technologies have been made ready not just as a lab or field prototype, but engineered for large scale use.

These are the result of R&D and good engineering. OCR technology is also under development.

The above technologies are available in 20+ Indian languages, with 350+ different AI models.

The Bhashini app has been available for free download for some time and provides basic services over mobile phone.

A large number of government ministries are using these technologies, provided as a free service by Meity.

Many of these are as voicebots to assist the users in availing online services, including enquiries about government schemes, filling forms, etc.

Bhashini technology has been used in translating lectures and course material for higher education available on the NPTEL and Swayam platforms.

This has been accomplished by video to video translation of lectures from original English into some 8 Indian languages.

More than 200 courses have been translated. Subtitling facility has also been made available.

More languages and courses are being covered as an ongoing activity.

Open sourcing of data and models, has allowed Indian language data to be used by a large number of individuals, institutions, and startups as free downloads.

What needs to be done now is to develop the market cycle by nurturing startups. They really need to provide services to their customers; governments or private.

Various sectors are waiting to be exploited, such as health, agriculture, school education, etc.

Technology acceleration centres planned under the mission can go a long way in energising the startups in the eco-system.

R&D needs to continue the exploration of completely new ways of building technology which can handle prosody in speech processing, and discourse in machine translation.

What is prosody in speech?

Prosody in speech refers to the rhythm, stress, and intonation of spoken language, or the 'music' of speech, and it conveys meaning and emotional nuance beyond individual words.

It uses features like pitch, loudness, and duration to signal things such as a statement versus a question, the speaker's emotions, sarcasm, or emphasis on certain words.

In future, these systems would utilise features from prosody such as tonal changes, pauses, emphasis, and sentiments in Indian languages.

Similarly, MT would not be limited to sentence to sentence translation at a time but do paragraph to paragraph translation. Indian academia is well poised to do it.

Finally, the social cycle needs to be jump started. It would mean involving the common man in building content for their languages, become adapt in using translation tools under Bhashini, and finally become the creator of original content in Indian languages.

State governments can play a major role in this. A revolution in Indian languages is waiting to be unleashed.

SHIVANAND KANAVI